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Abstract 


A  key  issue  to  the  U.  S.  Air  Force's  Airborne  Laser  (ABL)  program  is  the  ability  to 
accurately  predict  the  level  of  optical  turbulence  that  the  ABL  will  encounter  at  flight 
levels  in  the  upper  troposphere  and  lower  stratosphere.  The  optical  turbulence  must  be 
characterized  so  that  the  range  and  range  variation  of  the  ABL  can  be  determined. 

Gravity  wave  spectra  resulting  from  frontal  or  jet  stream  passage  are  presumed  to  cause 
layers  of  optical  turbulence;  however,  exact  relationships  between  optical  turbulence  and 
synoptic  scale  meteorological  phenomena  are  unclear. 

This  study  assesses  the  statistical  relationship  between  optical  turbulence  and  synoptic 
scale  variables  through  multiple  linear  regression.  The  optical  turbulence  measurements 
were  measured  by  the  50  MHz  radar  at  White  Sands  Missile  Range,  New  Mexico  from  the 
discontinuous  period  between  January  1993  to  January  1994.  Measurements  were 
averaged  ten^orally  and  vertically  to  coincide  with  weather  data.  The  synoptic  scale 
meteorological  data  was  extracted  from  the  National  Center  for  Environmental 
Prediction/National  Center  for  Atmoi^heric  Research  reanalysis  database. 

Results  from  the  regression  models  showed  that  a  linear  relationship  exists  between  the 
logarithm  of  optical  turbulence  and  major  synoptic  scale  variables;  however,  this 
relationship  was  a  weak  one.  Based  on  this,  it  was  concluded  that  fiirther  research  was 
needed  to  define  the  exact  relationship  between  synoptic  scale  meteorological  variables 
and  optical  turbulence. 
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REGRESSION  ANALYSIS  OF  RADAR  MEASURED  OPTICAL  TURBULENCE 


WITH  SYNOPTIC  SCALE  METEOROLOGICAL  VARIABLES 


1.  Introduction 


1.1  Background 

Operation  Desert  Storm  showed  that  U.  S.  armed  forces  have  limited  capability  against 
theater  ballistic  missiles.  The  U.S.  military's  current  defense  are  weapons  such  as  the 
Patriot  which  destroy  missiles  near  the  end  of  their  trajectory.  There  is  no  capability  to 
defend  against  missiles  shortly  after  they  have  been  launched— also  known  as  the  boost 
phase.  To  fulfill  this  need,  the  U.  S.  Air  Force  developed  the  Airborne  Laser  (ABL) 
program.  This  program  will  involve  placing  a  multimegawatt  laser,  a  beam  control 
system,  and  related  equipment  on  a  Boeing  747-400  aircraft,  enabling  detection  and 
destruction  of  enemy  missiles  during  their  powered  boost  phase  of  flight  (GAO,  1997:1). 

The  concept  of  the  ABL  system  is  to  detect  an  enemy  missile  shortly  after  its  takeoff, 
track  the  missile's  path,  and  direct  a  concentrated  laser  beam  on  the  missile  imtU  the  beam's 
heat  causes  the  missile's  pressurized  casing  to  fi-acture  and  e}q>lode  igniting  the  remaining 
fuel.  This  explosion  would  cause  the  missile's  warhead,  in  addition  to  any  nuclear, 
chemical  or  biological  agents  it  may  carry,  to  M  short  of  its  intended  target  (GAO, 
1997:2).  The  window  of  opportunity  is  only  from  the  time  the  missile  clears  the  cloud 
tops  to  its  booster  burnout,  since  the  missile  is  under  pressure  only  while  burning.  The 
window  can  range  from  30  to  140  seconds  based  on  missile  type.  ABL's  concept  is  shown 
in  Figure  1. 
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Figure  1.  ABL  Concept  (GAO,  1997:5) 


1 .2  Significance  of  the  Problem 

A  key  issue  to  ABL's  success  is  the  ability  to  predict  the  level  of  optical  turbulence  that 
the  ABL  is  expected  to  encounter  at  flight  levels  in  the  upper  troposphere  and  lower 
stratosphere  (GAO,  1997:2).  This  atmospheric  optical  turbulence  must  be  characterized 
so  the  range  and  range  variation  for  the  ABL  can  be  established.  Optical  turbulence,  also 
known  as  the  refractive  index  structure  parameter,  is  the  single  most  important  fector  in 
determining  the  electromagnetic  wave  propagation  characteristics  in  the  troposphere  and 
stratosphere  (Wamock  and  VanZandt,  1985:1).  The  ABL,  therefore,  must  be  able  to 
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compensate  for  the  optical  turbulence  in  order  to  maximize  its  range  and  decrease  its 
range  variation. 

Gravity  wave  spectra  resulting  from  activity  such  as  frontal  passage  or  jet  stream 
passage  are  presumed  to  cause  layers  of  optical  turbulence  (Nastrom  and  Eaton,  1 993a:  1 ; 
Bluestein,  1993:568).  However,  exact  relationships  between  optical  turbulence  and 
synoptic  scale  meteorological  phenomena  are  unclear,  and  continuous  sets  of  optical 
txirbiilence  measurements  for  extended  periods  of  time  are  rare  (Nastrom  and  Eaton, 

1 993b:  1 ).  If  a  strong  statistical  relationship  between  optical  turbulence  and  major 
synoptic  features  can  be  foimd,  a  means  of  forecasting  optical  turbulence  through  synoptic 
scale  variables  can  be  established.  This  statistical  relationship  is  related  to  the  range  and 
range  variation  of  ABL  technology  which  will  be  addressed  in  the  program’s  Authority  to 
Proceed  milestone  scheduled  for  summer  1998. 

1.3  Problem  Statement 

What  statistical  relationship  exists  between  radar  measured  values  of  optical  turbulence 
and  major  synoptic  scale  meteorological  variables? 

1 .4  Benefit  from  Solving  the  Problem 

A  strong  statistical  relationship  between  optical  turbulence,  characterized  by  the 
refractivity  structoe  parameter  Cn^,  and  specific  synoptic  features  would  enable  the  ABL 
team  to  establish  trends  of  Cn^  anywhere  on  the  globe  deriving  them  from  archived 
synoptic  scale  climatology.  A  strong  relationship  would  also  provide  the  ABL  team  with  a 
guide  as  to  where  to  collect  Cn^  data.  As  mentioned  in  1.2,  the  ability  to  forecast  Cn^  plays 
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a  major  role  in  demonstrating  the  ABL's  range  and  range  variation.  Weak  correlation 
between  Cn^  and  synoptic  variables  would  still  provide  useful  information  to  the  ABL 
team—it  would  eliminate  one  method  of  forecasting  optical  turbulence.  In  this  case,  the 
ABL  team  may  explore  other  forecasting  methods,  possibly  based  on  local  or 
mesoscale  phenomena,  in  order  to  demonstrate  the  range  of  the  ABL. 

1.5  Scope 

This  study  was  focused  on  determining  what  statistical  relationship  existed  between 
optical  turbulence  measurements  taken  from  the  50  Megahertz  (MHz)  radar  at  White 
Sands  Missile  Range,  New  Mexico,  and  corresponding  synoptic  scale  meteorological 
variables  extracted  from  the  National  Center  for  Environmental  Prediction/National  Center 
for  Atmospheric  Research  (NCEP/NCAR)  reanalysis  database  through  multiple  linear 
regression.  Specifically,  this  study  determined  the  relationship  between  Cn^  and  variables 
associated  with  fronts,  jet  streams,  the  Bulk  Richardson  Number  and  the  ageostrophic 
wind  at  the  synoptic  scale.  Details  on  the  data  set  will  be  discussed  in  Chapter  4. 

This  study  did  not  specifically  correlate  turbulence  to  gravity  wave  activity  using 
indicators  such  as  the  variance  of  vertical  velocity  (Nastrom  and  Eaton,  1993b:82). 
However,  since  some  non-convective  gravity  waves  can  be  generated  by  frontogenetical 
processes  or  by  air  parcels  moving  through  jet  streaks,  gravity  wave  activity  was 
accounted  for  in  this  study  (Bluestein,  1993:568).  For  exanq)le,  the  maximum  zonal  and 
meridional  winds  were  included  as  potential  predictors  in  the  regression  models.  Gravity 
waves  may  be  related  to  these  winds;  therefore,  gravity  wave  activity  was  indirectly 
accounted  for. 
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L6  General  Approach 


This  research  consisted  of  two  main  parts:  data  processing  and  multiple  regression 
analysis  as  described  below. 

1.6.1  Data  Processing. 

The  radar  measured  data  and  the  climatological  data  from  the  NCEP/NCAR 
reanalysis  database  was  provided  by  the  thesis  sponsor.  Air  Force  Research  Laboratory 
(AFRL).  Meteorological  parameters  associated  with  fronts,  jet  streams  and  the  bulk 
Richardson  Number  were  extracted  from  the  NCEP/NCAR  reanalysis  database  to 
correspond  to  the  location  and  dates  for  which  data  was  collected.  Since  the 
climatological  and  Cn^  data  was  not  simultaneous,  the  Cn^  values  were  averaged  in  six  hour 
increments  to  coincide  with  the  weather  data.  The  turbulence  measurements  were  also 
averaged  vertically  in  300  meter  increments  at  altitudes  corresponding  to  tropopause 
heights. 

1 .6.2  Multiple  Regression  Analysis. 

Multiple  regression  analysis  was  performed  with  the  Cn^  value  at  varying  altitudes  as  the 
dependent  variable  and  the  meteorological  values  as  the  independent  variables.  Multiple 
regression  analysis  was  appropriate  because  it  measured  the  degree  of  association  between 
the  turbulence  and  synoptic  scale  meteorological  values  (Tabachnick  and  Fidell,  1983:86). 
The  SAS  conq)uter  statistics  package  was  used  to  conqjlete  the  regression  analysis. 

1.7  Thesis  Orpanizatinn 

Chapter  2  discusses  past  and  recent  literature  related  to  the  problem  under 
investigation.  It  describes  methods  used  to  gain  an  understanding  of  small  scale 
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txirbulence  by  studying  Cn^  using  measurements  from  wind  profQer  radars.  It  also 
describes  some  modeling  efforts  and  their  comparisons  to  wind  profilers.  Finally,  it 
provides  a  synopsis  of  results  e3q)ected  based  on  the  literature  search. 

Chapter  3  provides  an  overview  of  the  theory  of  optical  turbulence  and  how  it  relates  to 
radar.  It  also  provides  details  on  the  radar  at  White  Sands  Missile  Range,  New  Mexico 
used  to  make  the  measurements.  Lastly,  it  describes  the  data  processing  method 
which  includes  an  outline  of  the  theory  behind  some  of  the  meteorological  variables 
derived. 

Chapter  4  describes  the  NCEP/NCAR  reanalysis  database  and  the  meteorological  data 
extracted  from  it  and  the  Cn^  data  observed  using  the  White  Sands  Missile  Range  radar.  It 
also  explains  the  statistical  tools  enployed  in  the  data  analysis  such  as  multiple  linear 
regression. 

Finally,  Chapter  5  presents  the  results  and  conclusions  of  this  study.  Recommendations 
for  further  research  are  also  given. 
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2.  Literature  Review 


2.1  Early  Work 

Understanding  small  scale  turbulence  enables  \is  to  understand  the  general  circulation  of 
the  atmosphere  including  effects  on  propagation  of  electromagnetic  waves  (Nastrom  and 
others,  1986:6722;  Nastrom  and  Eaton,  1993b:81).  One  method  to  gain  an  understanding 
of  turbulence  is  to  study  its  parameters,  such  as  the  refractivity  turbulence  structure 
parameter,  The  ability  to  measure  turbulence  parameters  was  enhanced  in  the  1970s 
and  1980s  with  use  of  pulsed  radars,  called  wind  profilers,  capable  of  detecting  returns 
from  the  clear  air.  Measurements  by  balloon,  aircraft  or  optical  techniques  were  also 
available,  but  usually  limited  to  short  campaigns  (Nastrom  and  others,  1986:6722). 

Finally,  modeling  also  played  a  major  role  in  estimating  turbulence  parameters  in  the  late 
1970s  and  1980s. 

2.1.1  VanZandt  and  others.  1978. 

In  their  research,  VanZandt  and  others  con:q)ared  observations  of  measured  by  a 
Doppler  radar  with  results  fi-om  a  theoretical  model.  The  hypothesis  of  their  paper  was  to 
determine  if  fluctuations  in  refi^tive  index  measured  by  radar  were  close  to  values  of 
optical  turbulence  calculated  by  their  model.  The  model  they  developed  calculated  Cn^ 
from  rawinsonde  profiles  of  wind,  tenperature  and  humidity.  The  radar  observations  were 
measured  by  Doppler  radar  which  is  located  about  15  kilometers  west  of  Boulder, 

Colorado  (VanZandt  and  others,  1978:  819). 


7 


First,  VanZandt  and  others  described  their  radar  measured  optical  turbulence  values. 
Measurements  were  taken  every  50  seconds  at  heights  ranging  from  5  to  15  kilometers. 
Siuce  the  Q,^  observations  varied  rapidly  with  time,  they  averaged  their  50-second 
measurements  to  22  to  72-minute  averages  (VanZandt  and  others,  1978:822).  They  then 
conqjared  Cn^  measurements  for  8  March  1978,  which  had  light  winds,  and  19  March 
1978,  which  had  a  moderate  jet  stream.  Their  results  indicated  that  the  magnitude  of 
increased  with  increasing  vertical  shear  (VanZandt  and  others,  1978:823). 

Next,  they  discussed  the  model  calculations  and  con^ared  both  sets  of  data  for  five 
different  days.  There  was  a  basic  agreement  between  the  model  and  radar  observations  so 
their  calculations  strongly  supported  their  hypothesis.  Their  conq>arison  also  indicated 
relative  maxima  in  magnitude  in  regions  of  maximum  shear  above  and  below  the  peak 

of  the  jet  stream  in  both  the  radar  and  model  values  (VanZandt  and  others,  1978:827). 

The  good  agreement  between  radar  observations  and  model  calculations  denoted  that 
both  the  radar  reflectivity  and  theoretical  model  were  good  estimators  of  the  vertical 
profile  of  optical  turbulence.  Therefore,  since  radars  have  both  excellent  temporal  and 
spatial  resolution,  measurements  will  be  extremely  usefiil  for  studying  time  and  space 
variations  of  (VanZandt  and  others,  1978:827). 

2.1.2  Wamock  and  VanZandt.  1985. 

Wamock  and  VanZandt  described  and  provided  conqjuter  code  for  a  modified  version 
of  the  statistical  model  described  in  2. 1 . 1 .  This  version  of  the  model  estimated  average 
values  of  Cn^  for  layers  of  atmosphere  about  150  meters  thick  from  National  Weather 
Service  balloon  data  or  a  conparable  data  set.  The  use  of  a  statistical  model  was 
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appropriate  because  they  believed  that  the  random  fluctuations  of  refractive  index  and 
other  related  physical  characteristics  were  best  described  by  statistics.  They  further 
supported  their  use  of  a  statistical  model  by  stating  that  direct  calculation  of  Cn^  required 
precise  measurements  of  the  meteorological  values  and  their  gradients  with  a  spatial 
resolution  of  approximately  one  meter  throughout  the  entire  atmosphere.  Since  this  data 
was  not  available,  they  relied  on  measurable  large-scale  quantities  and  statistics  (Wamock 
and  VanZandt,  1985:2). 

They  also  e3q)lained  that  fine-scale  turbulent  flows  are  embedded  in  the  large-scale 
laminar  flow  in  thin,  stratified  layers.  They  suggested  these  local  turbulent  flows  were 
caused  by  shear  instability  in  areas  where  the  large-scale  flow  is  stable,  but  where  the  fine- 
scale  flow  is  unstable  (Wamock  and  VanZandt,  1985:3).  They  also  described  thinner 
layers,  several  tens  of  meters  or  thinner,  occurring  more  often,  with  many  in  the 
troposphere  and  stratosphere  at  any  time  (Wamock  and  VanZandt,  1985:4). 

2.1.3  Wamock  and  others.  1985. 

Wamock  and  others  compared  12  winter  and  22  summer  profiles  of  radar-measured 
and  model-calculated  values  of  from  the  revised  model  just  described  in  2. 1 .2.  For 
each  season,  large  variations  in  were  encoxmtered,  sometimes  near  two  orders  of 
magnitude  or  greater  at  one  specific  height. 

Wamock  and  others  found  the  model's  calculations  agreed  with  the  radar  data.  More 
importantly,  the  model's  calculations  provided  insight  into  what  weather  conditions  caused 
the  Cn^  fluctuations.  During  the  winter  period,  they  discovered  that  large  increases  in  wind 
shear  accompanying  jet-stream  passages  increased  the  intensity  of  turbulence  and  thus 
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produced  large  increases  in  Cn^.  During  the  summer,  without  strong  convection, 
values  above  10  kilometers  were  similar  to  winter  values;  however,  below  10  kilometers 
the  summer  values  were  greater  than  winter  values  even  imder  light  wind  conditions. 
Wamock  and  others  showed  that  these  greater  Cn^  values  were  due  to  the  humidity  and 
humidity  gradients  contributing  to  the  refractive  index  and  were  not  caused  by  increased 
turbulence  intensity  (Wamock  and  others,  1985:158). 

2.1.4  Nastrom  and  others.  1986. 

Nastrom  and  others  studied  the  variability  of  Cn^  at  altitudes  of  4  to  20  kilometers 
measured  by  the  Doppler  radars  at  Poker  Flat,  Alaska,  and  Platteville,  Colorado  (Nastrom 
and  others,  1986:6722).  The  data  from  the  Poker  Flat  radar  was  taken  from  September 
1981  to  September  1982.  Observations  from  the  Platteville  radar  were  taken  from  April 
1983  to  February  1984  (Nastrom  and  others,  1986:6724).  They  investigated  variations  of 
Cn^  over  30  day  periods  by  examining  3-hour  medians.  Medians  were  used  instead  of 
means  because  they  were  less  sensitive  to  extreme  values  due  to  interference  such  as 
reflections  from  airplanes  (Nastrom  and  others,  1986:6725). 

They  found  that  followed  a  lognormal  distribution  at  all  heights  they  sampled.  This 
enabled  them  to  statistically  analyze  the  data  assuming  a  normal  distribution. 

They  first  calculated  the  autocorrelation  function  for  all  9-hour  intervals  where  at  least 
85%  of  the  observations  remained  after  data  editing  was  completed.  The  individual 
functions  were  then  averaged  and  examined  (Nastrom  and  others,  1986:6727).  They 
determined  that  the  autocorrelation  function  could  be  modeled  as  the  sum  of  a  random 
process  and  first  order  autoregressive  process  (Nastrom  and  others,  1986:6733).  Since 
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the  autocorrelation  function  also  estimates  the  effective  time  between  independent  values 
of  Cn^,  they  found  the  effective  time  decreased  with  height  from  25  to  45  minutes  in  the 
troposphere  to  approximately  18  minutes  in  the  stratosphere.  In  other  words,  in  the 
stratosphere,  a  value  is  no  longer  related  to  the  value  measured  18  minutes  after  it 
(the  autocorrelation  function  is  described  further  in  4.2.1).  They  also  noted  that  the 
logarithm  of  Cn^,  abbreviated  as  log  (Cn^),  had  the  largest  mean  value  when  the 
autocorrelation  value  was  the  largest  (Nastrom  and  others,  1986:6728). 

Second,  they  confuted  monthly  averages  of  the  3  hour  medians,  or  monthly  means  of 
the  medians,  of  log  (Cn^)  (Nastrom  and  others,  1986:6730).  The  data  showed  largest 
monthly  means  in  the  winter  with  a  secondary  maximum  in  the  summer.  The  winter 
maximum  is  related  to  enhanced  baroclinic  activity  such  as  wind  speed,  wind  shear  and 
gravity  waves.  The  summer  maximum  is  related  to  increased  convection  (Nastrom  and 
others,  1986:6733). 

Finally,  they  examined  the  relationship  between  and  other  variables  using 
correlation  coefficients.  The  coefficients  indicated  that  log  (Q,^)  at  every  level  was  most 
closely  correlated  with  wind  sp>eed  near  the  tropopause.  They  also  found  that  conditions 
near  the  surftice  seemed  to  affect  Cn^.  Nastrom  and  others  stated  that  in  specific  cases,  Cn^ 
relied  on  the  synoptic  weather  picture  and  not  just  the  wind  flow  and  static  stability  where 
the  turbulence  was  measured.  Furthermore,  a  strong  correlation  between  C^,  gravity 
waves  and  background  wind  was  discovered  (Nastrom  and  others,  1986:6731). 
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2.1.5  Tsuda  and  others.  1988. 


Tsuda  and  others  discussed  echoes  observed  in  the  troposphere  and  lower  stratosphere 
in  September  1986  by  the  middle  and  upper  (MU)  radar  site  in  northern  Japan.  They  also 
touched  upon  mechanisms  that  determine  refractive  index  gradients  in  the  atmosphere 
above  10  kilometers  or  what  they  referred  to  as  the  dry  atmosphere.  Finally,  they 
discussed  effects  of  gravity  waves  on  layers  in  the  dry  atmosphere  (Tsuda  and  others, 
1988:655). 

Data  was  collected  at  vertical  heights  ranging  from  about  5  kilometers  to  21  kilometers 
with  a  vertical  resolution  of  150  meters  and  tenporal  resolution  of  74  seconds.  The  data 
was  then  averaged  over  five  records  to  condense  the  data  set.  After  examining  the  data, 
they  found  a  thin,  concentrated,  echoing  layer  between  9-12  kilometers  and  moderately 
stratified  reflection  layers  with  upward  and  downward  phase  progressions  in  the 
stratosphere  (Tsuda  and  others,  1988:656).  They  suggested  this  intense  layer  between  9 
and  12  kilometers  corresponded  to  a  region  of  large  vertical  shear,  characterizing  large 
temperature  gradients  in  both  the  vertical  and  horizontal  directions.  Weaker  layers 
observed  in  the  troposphere  were  also  related  to  areas  of  strong  wind  shear  (Tsuda  and 
others,  1988:659).  Finally,  they  suggested  gravity  waves  affected  the  reflection  layers, 
causing  temperature  fluctuations  in  the  stratosphere  (Tsuda  and  others,  1988:656-658). 

2.2  Recent  Work 

Recent  studies  of  optical  turbulence  have  shifted  from  wind  profiler  radars  to  radars 
with  better  resolution.  In  the  past  few  years,  scientists  have  taken  advantage  of  very  high 
frequency  (VHF)  Doppler  radars  that  provide  excellent  spatial  and  tenporal  resolution  for 
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measuring  Cn^.  They  also  provide  the  capability  to  obtain  Cn^  data  sets  with  observations 
taken  on  a  continuous  basis  for  extended  periods  of  time. 

2.2.1  Frisch  and  others.  1990. 

Frisch  and  others  calculated  monthly  mean  backscattered  power  for  five  years  using 
observations  fi’om  the  50-megahertz  (MHz)  wind  profiler  in  Fleming,  Colorado.  Their 
purpose  in  this  study  was  to  better  understand  variations  in  the  backscattered  signals  and 
C^,  since  backscattered  power  is  directly  proportional  to  Cn^.  They  suggested  that 
understanding  these  variations  would  provide  further  insight  into  atmospheric  turbulent 
processes  (Frisch  and  others,  1990:645-646). 

Frisch  and  others  averaged  the  backscattered  power  into  2-kilometer  intervals  and  then 
over  a  thirty-day  period  (Frisch  and  others,  1990:647).  After  reviewing  the  data,  they 
discovered  changes  of  2  decibels  (dB)  or  larger  in  the  year-to-year  variation  of  Cn^  (Frisch 
and  others,  1990:65 1).  They  also  found  a  seasonal  cycle  in  Cn^  below  the  tropopause  in 
addition  to  the  year-to-year  change.  In  contrast,  they  found  no  seasonal  variation  above 
the  tropopause.  They  also  studied  a  record  of  backscattered  power  fi’om  the  Stapleton 
aiport  wind  profiler  in  Denver,  Colorado,  and  found  comparable  longer-term  trends 
(Frisch  and  others,  1990:645). 

2.2.2  Nastrom  and  Eaton.  January  1993. 

Nastrom  and  Eaton  studied  the  winds  and  turbulence  using  observations  from  the  50- 
megahertz  (MHz)  VHF  Doppler  radar  at  White  Sands  Missile  Range  (WSMR),  New 
Mexico.  They  demonstrated  that  imder  specific  conditions,  Cn^  values  in  the  lower 
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stratoq)here  vary  by  up  to  an  order  of  magnitude  depending  on  the  source  of  gravity  wave 
activity  in  the  lower  atmosphere  (Nastrom  and  Eaton,  1993b:81). 

Observations  were  obtained  for  10  days  in  March  and  April  1991,  providing  a 
continuous  data  set.  Inherent  in  the  observations  was  a  variety  of  synoptic  weather 
patterns  such  as  surface  cold  fronts,  surfrice  troughs  and  ridge  axes.  Data  was  available  in 
3-minute  increments  at  150-meter  resolution  for  altitudes  from  about  3  to  20  kilometers 
(Nastrom  and  Eaton,  1993b:82). 

Nastrom  and  Eaton  con^ared  mean  values  of  the  logarithm  of  Cn^  to  high  and  low  wind 
speeds  at  5.6  kilometers  (Nastrom  and  Eaton,  1993b:84).  The  wind  speed  at  5.6 
kilometers  was  used  as  an  indicator  of  synoptic  weather  activity  (Nastrom  and  Eaton, 
1993b:86).  They  showed  that  mean  values  of  the  logarithm  of  Cn^  were  greater  at  all 
heights  during  periods  of  strong  winds  than  during  periods  of  weak  winds.  The  increased 
values  of  Cn^  in  the  lower  troposphere  were  probably  due  to  enhanced  moisture  present 
during  passage  of  fronts  and  troughs  that  were  connected  dynamical^  with  the  strong 
winds  at  5.6  kilometers.  Larger  values  of  at  higher  altitudes  were  probabfy  due  to 
increases  in  turbulence  (Nastrom  and  Eaton,  1993b:84). 

Nastrom  and  Eaton  hypothesized  that  enhanced  turbulence  in  the  stratosphere  during 
periods  of  high  winds  at  5.6  kilometers  was  due  to  the  breaking  of  upward-propagating 
gravity  waves  that  were  generated  by  conditions  in  the  lower  tropo^here  related  to 
troughs  (Nastrom  and  Eaton,  1993b:86).  They  concluded  that  that  future  models  of  Cn^ 
should  accoimt  for  the  intensity  of  gravity-wave  source  mechanisms  in  addition  to 
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variables  related  to  the  background  flow  such  as  mean  wind  and  temperature  (Nastrom 
and  Eaton,  1993b:87). 

2.2.3  Nastrom  and  Eaton.  1995. 

Nastrom  and  Eaton  (1995)  described  the  average  vertical  profiles  of  winds  and 
turbulence  and  their  seasonal  and  diurnal  changes  using  data  measured  by  WSMR  radar 
(Nastrom  and  Eaton,  1995:2135).  The  general  time  period  for  observations  taken  for 
their  study  was  from  January  1991  to  April  1994  (Nastrom  and  Eaton,  1995:2137). 

First,  Nastrom  and  Eaton  found  the  measurements  of  the  logarithm  of  to  be 
normally  distributed  (Nastrom  and  Eaton,  1995:2141).  Second,  their  data  showed  that  the 
largest  values  of  in  the  troposphere  were  found  in  the  summer  and  were  related  to 
areas  of  high  humidity.  In  spring,  fall  and  winter  and  in  the  stratosphere,  there  was  little 
variation  in  C^.  This  probably  occurred  because  the  mean  wind  shear  was  feirly  constant 
with  season,  and  the  mean  static  stability  in  the  stratosphere  changed  little  with  season 
(Nastrom  and  Eaton,  1995:2143).  Finally,  they  noticed  relatively  small  diurnal  variations 
in  the  logarithm  of  Cn^  (Nastrom  and  Eaton,  1995:2145).  The  largest  dinmal  variation 
they  did  find,  however,  was  in  the  troposphere  during  summer  and  had  a  range  of  about  5 
decibels  (Nastrom  and  Eaton,  1995:2147). 

2.3  Results  Expected 

Based  on  the  literature  summarized  above,  it  seemed  reasonable  to  average  data 
temporally  and  spatially  for  this  study’s  requirements,  and  to  define  the  relationship 
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between  and  weather  variables  statistically.  In  addition,  a  strong  statistical  relationship 
between  the  logarithm  of  Cn^  and  the  following  variables  was  e3q)ected: 

(1)  Near-tropopause  wind 

(2)  Ageostrophic  wind 

(3)  Bulk  Richardson  number  (which  is  inversely  proportional  to  wind  shear  assuming 
constant  static  stability) 
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3.  Methodology 


3.1  Theory 

The  index  of  refraction  of  the  atmosphere  for  electromagnetic  waves  depends  on  the 
temperature  and  humidity  of  the  air  (Tatarski,  1961 :1).  The  potential  refractivity,  derived 
below,  can  be  used  to  calculate  the  size  of  fluctuations  in  the  refractive  index  caused  by 
temperature  and  humidity  changes  (Tatarski,  1961 :58).  The  structure  frmction,  which 
characterizes  the  intensity  of  refractive  index  fluctuations,  and  its  associated  structure 
parameter  is  also  given  (Tatarski,  1961:10).  Lastly,  the  relationship  between  the 
refractivity  structure  parameter,  received  power  and  radar  reflectivity  is  provided. 

3.1.1  Index  of  Refiiaction. 

The  index  of  refraction  n  for  radio  waves  is  a  frmction  of  temperature  T  (K),  pressure  p 
(mb),  and  vapor  pressure  e  (mb)  (Tatarski,  1961 :55).  This  relationship  can  be  shown  as: 


w-l  =  10'* 


4800g^ 
T  )‘ 


(1) 


Any  rapid  change  in  the  pressure,  ten:q)erature  or  humidity  can  cause  rapid  changes  in  the 
refractive  index.  Since  rapid  changes  in  pressure  over  very  small  distances  are  highly 
unlikely,  fluctuations  in  the  index  of  refraction  are  mostly  caused  by  large  tenq)erature  and 
humidity  irregularities  in  the  air  (Rinehart,  1991:148). 

Some  properties  of  the  atmosphere  can  be  regarded  as  conservative  quantities  or 
quantities  that  remain  constant  during  certain  transformations  (Tatarski,  1961:40; 
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Wallace  and  Hobbs,  1977:69).  The  variables  T  and  e  in  ( 1  )  are  not  conservative; 
therefore,  they  may  affect  the  dynamics  of  the  turbulence  (Tatarski,  1961 :40,55).  For 
example,  when  small  parcels  of  air  are  moved  vertically,  their  pressure  undergoes  a 
continuous  equalization  process  with  the  ambient  pressure  at  each  altitude. 

These  pressure  changes  create  tenqjerature  changes  which  satisfy  the  following 
equation: 


dT  y -\dp 

T"~r7^ 

where 


(2) 


/=  Cp/Cv 

Cp  =  specific  heat  at  constant  pressure  (J  K  *  kg'*) 
c»  =  specific  heat  at  constant  volume  (J  K  *  kg*’). 

The  quantity  in  ( 2  )  is  related  to  the  change  in  height  dz  (m)  by  the  hydrostatic 
approximation: 


(3) 


where  p  is  the  density  of  the  air  (kg/rn^)  and  g  is  the  acceleration  due  to  gravity  (9.8 
m/sec^).  Using  the  ideal  gas  law,  one  can  show  that: 


T 


r  p 


r  R,T 


(4) 


dT  ^  r-\  g  _  g 
dz  y  Rj  Cp 


(5) 
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where  Rd  is  the  ideal  gas  constant  for  dry  air  (287  J  K'*  kg'*)  and  ya  is  the  dry  adiabatic 
lapse  rate  (K/km).  Integrating  (  5  )  leaves  T  +  y^z  =  const  and  when  parcels  are  displaced 
vertically,  the  e3q)ression: 

e=T  +  y,z  (6) 

gives  the  potential  ten:q)erature  0  (K)  which  is  a  conservative  quantity.  The  vapor 
pressure  e  can  also  be  transformed  into  a  conservative  quantity  by  expressing  it  in  terms  of 
specific  humidity  q  (Tatarski,  1961:56): 

e  =  \.62pq.  (7) 


Substituting  (  6 )  and  (  7  )  into  (  1 )  gives  an  expression  with  conservative  quantities  of  6 
and  q  (Tatarski,  1961:57): 


(/7-l)xl0®  = 


19p  j'^^  7800^" 

e-y^zy  e-y^zj 


(8) 


This  form  of  the  refiractive  index,  however,  is  not  a  "good  passive  tracer"  (Hocking, 
1985:1406).  This  can  be  e3q)lained  by  the  following  situation.  Suppose  an  air  parcel  is 
displaced  from  level  zi  to  a  new  level  Z2.  Suppose  at  level  zi,  the  parcel  was  in  equilibrium 
with  the  environment  and  at  this  level  there  was  a  pressure  pi,  potential  tenperature  0j 
and  specific  humidity  qi.  Suppose  at  level  Z2,  the  ambient  pressure  is p2  and  the 
environmental  potential  tenqjerature  and  specific  humidity  are  02  and  q2,  respectively.  At 
level  Z2,  the  potential  temperature  and  specific  humidity  of  the  parcel  can  be  represented  as 
01  and  qi,  since  these  are  conservative  quantities.  The  pressure  of  the  parcel,  however, 
has  changed  to  p2  at  level  Z2.  The  change  in  refractive  index  between  the  parcel  and 
environment  at  level  Z2  can  be  shown  as: 
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^  =  n parcel  (^2  ^  (^2  ^  .^2  ) 


(9) 


or 


Aw  = 


dq  dzj 


Az, 


(10) 


where  zlz  =  Z2  -  z;.  Equation  (  10  )  is  not  just  the  difference  in  refractive  index  at  levels  zj 


and  Z2,  which  would  have  been  (Hocking,  1985:1407): 


An  = 


^£n£p 

\dp  dz 


+ 


dn  dO  dn  d<^ 
dO  dz  dq  dz) 


Az. 


(11) 


Therefore,  fluctuations  which  arise  are  not  proportional  to  the  total  gradient  of  n,  shown 

\ 

in  ( 1 1 ),  but  rather  to  (  10 ).  This  actual  change  is  refractive  index  is  referred  to  as  the 
potential  refractivity  M shown  as  (VanZandt  and  others,  1978:823;  Hocking,  1985:1407; 
Nastrom  and  Eaton,  1993a:23236): 


Af  =  -77.6x10 


(12) 


where 


H,  =1  + 


15500^ 


1- 


d]nq/  ^ 
1  /dz 


2  d]nO/ 


^dzj 


The  value  q  dominates  in  the  lower  troposphere,  but  is  negligible  in  the  stratosphere  where 
the  average  value  of  large  (VanZandt  and  others,  1978:823;  Hocking, 


1985:1407).  Therefore,  in  the  stratosphere  where  it  is  relative^  dry.  Hi  is  approximately 
equal  to  one  (Nastrom  and  Eaton,  1993a:23236). 
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3.1.2  Refractivitv  Txirbulence  Structure  Function  and  Parameter. 

The  structoe  function  is  a  fundamental  characteristic  of  a  stationary  random  process. 


Generally  speaking,  the  value  of  the  structure  hmction  describes  the  intensity  of  the  optical 
turbulence  fluctuations  (Tatarski,  1961:10). 

The  structure  function  D„  (r)  of  the  atmospheric  refractive  index  for  the  inertial  range 
can  be  shown  as  (Tatarski,  1961:58): 


2  «2/3 


C2  J2I3 


r 


for  4  «r«L 
for  r  «  /„ 


(13) 


where 

r  =  spatial  separation  (m) 

Cf  =  refractivity  turbulence  structure  parameter  (m'^^) 
lo  =  inner  scale  of  turbulence  (size  of  smallest  eddies)  (m) 

L  -  outer  scale  of  tmbulence  (size  of  largest  eddies)  (m). 

The  inertial  range  is  where  large  eddies  break  up  into  smaller  eddies,  due  to  fluid 
dynamic  instabilities,  and  these  smaller  eddies  break  down  into  even  smaller  ones.  This 
cascade  of  energy  from  large  scales  to  smaller  scales  continues  imtil  the  scale  becomes  so 
minute  that  viscosity  effects  are  introduced.  At  this  point,  the  energy  flows  out  of  the 
cascade  into  the  sink  of  molecular  heat  (Dewan,  1980:1 1).  The  inertial  subrange  falls 
between  the  inner  scale  lo  and  outer  scale  L  of  turbulence. 
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The  refractivity  turbulence  structure  parameter  C„^  for  the  inertial  subrange  can  be 
defined  as: 

Cl=a^aL^'^M\  (14) 

where  is  a  universal  constant  taken  to  be  2.8  and  a'  is  a  ratio  of  eddy  difi^sivities  which 
is  close  to  1  (VanZandt  and  others,  1978:823;  Ottersen,  1969:1182).  Equation  ( 14  )  is 
valid  for  homogeneous,  isotropic,  steady  state  turbulence.  Txnbulence  is  homogeneous 
and  isotropic  when  the  structure  fimction,  Z)„  (r),  does  not  depend  on  direction  or  location 
of  r.  Steady  state  can  be  interpreted  as  the  amoimt  of  energy  that  flows  into  the 
turbulence  being  the  same  amoimt  of  energy  dissipated  into  heat  (Dewan,  1980:1 1). 

3.1.3  Refiractivitv  Turbulence  Structure  Parameter  and  Radar  Reflectivity. 

A  source  of  clear  air  return  in  the  atmosphere  is  fiiom  refiuctive  index  perturbations. 
When  the  index  of  reifraction  changes  considerably  over  scales  which  are  small  compared 
to  the  radar  wavelength,  the  area  containing  these  small  scale  fluctuations  can  return  some 
of  the  incident  power  back  toward  the  radar.  If  the  radar  is  sensitive  enough,  this  can 
cause  a  detectable  echo  to  be  displayed  on  the  radar  (Rinehart,  1991 :147).  This 
relationship  between  the  backscattered  power  and  Cn^  can  be  examined  by  first  looking  at 
the  radar  equatioa 

The  radar  equation  is  given  by  (Frisch  and  others,  1990:645): 


where 


Pjj  =  received  power,  averaged  over  several  independent  reali2ations  (W) 
c  =  speed  of  light  (3x10*  m/sec) 

Pt  =  transmitted  power  (W) 

G  =  on-axis  gain  of  the  antenna 

9r^^-  half-power  beamwidths  of  the  antenna  (rad) 

r=  pulse  duration  (sec) 

X  -  radar  wavelength  (m) 

rj  =  radar  reflectivity  (m  *)  or  radar  cross-section  per  unit  volume 
Ran  =  range  fi-om  radar  (m). 

Equation  ( 15  )  shows  that  the  received  power  ,  also  referred  to  as  the  backscattered 
power,  is  proportional  to  the  radar  reflectivity  ij. 

In  clear  air,  the  radar  reflectivity  can  be  related  to  refi'active  index  fluctuations  with 
spatial  scales  X/2  through  the  structure  flmction  in  (  13  )  (Ottersen,  1969:1 182;  Frisch  and 
others,  1990:645).  If  2/2  fells  within  the  inertial  subrange  of  turbulence,  then  the  radar 
reflectivity  is  directly  proportional  to  as  shown  (Ottersen,  1969:1 183;  Hocking, 
1985:1411): 

;7«0.38  (16) 

Substituting  ( 16  )  into  (  15  )  and  solving  for  gives  the  following  (Nastrom  and  others, 
1986:6724): 


where 


,,  18435 

~  A  ^,xc' 

Unis,  Cn^  is  directly  proportional  to  the  radar  backscattered  power  as  well 
3.2  Radar  Used  to  Collect  Data 

The  50-megahertz  (MHz)  radar  used  to  collect  Ci  values  for  this  study  is  located  at  the 
Atmospheric  Proftter  Research  Facility  (APRF)  at  White  Sands  Missile  Range,  New 
Mexico,  32*24^,  106®21'W  (Hines  and  others,  1996:1 1;  Nastrom  and  Eaton,  1993b:82). 
The  APRF  complex  is  positioned  on  the  floor  of  the  Tularosa  basin  approximately  1220 
meters  above  mean  sea  level  and  about  10  kilometers  east  of  the  San  Andres,  San 
Augustine  and  Organ  Mountains  (Hines  and  others,  1996:13). 

The  50-MHz  profiler,  manufectured  by  Tycho  Technology,  Inc.,  contains  three  antenna 
beams.  Each  antenna  beam  takes  about  1  minute  to  make  a  measurement,  so  a  full  cycle 
takes  about  3  minutes  (Hines  and  others,  1 996: 1 8).  One  radar  beam  is  pointed  toward  the 
zenith,  while  the  other  two  radar  beams  are  pointed  15°  fi:om  the  zenith;  one  is  pointed 
toward  the  south  and  the  other  is  pointed  toward  the  west.  The  antenna  has  a  diameter 
about  150  meters  and  generates  a  one-way  beamwidth  of  2.9°  (Hines  and  others, 
1996:113). 

The  radar  functions  at  a  central  firequency  of 49.25  MHz  and  a  transmitted  peak  power 
of 250  kilowatts  (kW)  (Hines  and  others,  1993:1 13-1 14).  Received  power  observed  for 
one  minute  along  each  of  three  beams  is  used  to  calculate  a  full  profile  every  3  minutes 

at  a  150-meter  resolution.  In  normal  mode,  112  range  gates  are  used  to  sanple  fi'om 
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about  3  to  20  kilometers  above  groimd  level  (Hines  and  others,  1993:1 13;  Nastrom  and 
Eaton,  1993b:82). 

The  radar  equation  used  to  compute  Cn^  from  this  50  MHz  radar  is  (Hines  and  others, 
1993:114): 


2  112.16A^" 

SP,L,A„ 

where 


(18) 


X  =  radar  wavelength  (m) 

Ran  =  range  from  radar  (m) 

Pr  =  received  power  (W) 

5=  range  resolution  (m) 

Pt  =  peak  power  at  transmitter  (W) 

Lt  =  total  receiver  and  transmission  line  loss  and  antenna  efSciency 
Aem  =  effective  antenna  area  (m^). 

Comparisons  between  annual  mean  Cn^  measurements  made  by  the  south  and  west 
beams  of  the  radar  showed  differences  of  about  1  dB  in  the  troposphere  and  2  dB  in  the 
stratosphere.  It  was  suggested  that  this  difference  may  indicate  anisotrophy  in  small-scale 
turbulence  (Nastrom  and  Eaton,  1995: 2144).  It  was  necessary  to  increase  average  Cn^ 
values  in  the  west  beam  to  coincide  with  the  average  values  in  the  south  beam. 

Manufecturer's  tests  of  the  50-MHz  wind  profiler  show  that  the  accuracy  of  the  Cn^ 
measurements  are  within  ±1.5  dB.  The  antenna  is  calibrated  to  ±  0.5  dB,  but  the 
difference  between  the  oblique  beams  must  be  included  in  this  uncertainty  (Hines  and 
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others,  1996:1 14).  The  parameters  of  the  WSMR  50-MHz  radar  used  in  this  study  are 
summarized  in  Table  1 . 


Table  1.  Radar  parameters  used  in  this  study  (Nastrom  and  Eaton,  1993b:2136). 


TRANSMITTER 

Central  frequency 

49.25  MHz 

Wavelength 

6m 

Output  power 

250  kW  peak 

Duty  cycle 

5% 

Pulse  width 

8  psec 

Type 

Combination  solid  state 
preanplifier  and  tube  cavity 
amplifier,  three  stage 

ANTENNA 

Number  of  beams 

3 

Physical  aperture 

Effective  aperture 

Pointing 

Zenith  and  15°  to  south  and  west 

Type 

Coaxial  coUinear  phased  array 

One  way  beamwidth 

2.9° 

RECEIVER 

Type 

Low  noise  superheterodyne 

Bandwidth 

Matched  to  transmitted  pulse 

Receiver  noise  figure 

Less  than  1  dB 

PERFORMANCE 

Nominal  lowest  range  gate 

2  km  above  ground  level 

150  m 

Time  resolution 

1  min  per  beam  (3  min  total) 

Number  of  range  gates 

112 

Calibrated  Cn^  range 

Bandwidth 

IMHz 

Power  aperture  product 

1  X  10"  wW 

Error  in  Cn^  measurements 

±1.5dB 
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3.3  Data  Processing 


Both  the  meteorological  and  optical  turbulence  data  had  to  be  processed  in  order  to 
complete  the  multiple  regression  analysis.  The  procedures  are  described  below. 

3.3.1  Meteorological  Data 

The  synoptic  scale  meteorological  data  was  extracted  from  the  NCEP/NCAR  reanalysis 
database  and  was  provided  by  the  sponsor,  AFRL,  in  a  non-gridded  binary  format.  Data 
was  provided  for  latitudes  between  10®  and  65°  N  and  for  the  entire  longitude  grid.  Since 
the  radar  site  is  located  at  32°24'N,  106°2rW,  a  smaller  grid  was  extracted  from  this  grid 
to  coincide  with  the  radar  location.  This  grid  encloses  the  area  between  30°-35°N  and 
102®30'-1 1 0®W  with  the  radar  site  centrally  located  as  shown  in  Figure  2.  The  intention 
was  to  have  an  area  large  enough  to  capture  portions  of  the  jet  stream  near  or  above  the 
radar  site. 

The  variables  used  in  this  data  set  are  shown  in  Table  2.  The  levels  indicate  the  number 
of  pressure  levels  that  particular  variable  is  available  for,  with  a  maximum  number  of  17 
pressure  levels  that  range  from  1000  to  10  millibars.  If  the  variable  is  only  available  at  one 
vertical  level,  no  level  is  given.  Note  that  all  these  variables  can  be  considered  related  to 
fronts,  jet  streams  or  both.  Therefore,  all  variables  provided  by  AFRL  were  used  as 
potential  independent  variables  in  the  regression  analysis  with  the  exception  of  surfece 
geopotential  height  since  it  varied  little  with  time  or  space. 

The  ageostrophic  wind  and  bulk  Richardson  number  were  derived  from  the  other 
variables  in  Table  2.  The  procedures  for  these  calculations  are  e:!q)lained  below. 
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Figure  2.  Grid  used  in  this  study.  The  50  MHz  radar  is  centrally  located  at  WSMR. 
3.3. 1.1  Ageostrophic  Wind. 

An  initial  look  at  the  relationship  between  the  ageostrophic  wind  and  Cn^  by  AFRL 
showed  that  a  correlation  may  exist.  The  sponsor  requested  further  investigation; 
therefore,  the  ageostrophic  wind  was  included  as  another  potential  independent  variable  in 
the  regression  analysis. 

The  ageostrophic  wind  V^g  (where  the  arrow  over  V  denotes  a  vector)  was  calculated 
using  the  difference  between  the  total  wind  V  and  the  geostrophic  wind  Vg  or: 

=  (19) 

The  total  wind  components,  U  (east- west  or  zonal  wind)  and  V  (north-south  or  meridional 
wind),  were  given  in  the  NCEP/NCAR  data  set;  however,  the  geostrophic  wind  was 
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unknown.  Since  the  geopotential  height  if  was  available,  the  geostrophic  wind  could  be 
confuted  from  the  following  cross  product  (Holton,  1992:59): 

F^=y^xVii,  (20) 

where  /is  the  Coriolis  parameter  (sec' ),  A  is  the  xmit  vector  in  the  vertical  direction,  and 
V  is  the  gradient  in  rectangular  coordinates. 


Table  2.  Meteorological  variables  used  in  this  study. 


VARIABLE 

SYMBOL 

LEVELS 

UNITS 

Surfiice  pressure 

PS 

Pa 

Temperature  at  tropopause 

TT 

K 

Pressure  at  tropopause 

PT 

Pa 

U-winds  at  tropopause 

UT 

m/sec 

V- winds  at  tropopause 

VT 

m/sec 

V%d  shear  at  tropopause 

SHT 

1/sec 

Surfece  lifted  index 

LI 

K 

Best  (4-layer)  index 

B 

K 

Temperature  at  max  wind  level 

TM 

K 

Pressure  at  max  wind  level 

PM 

Pa 

U-winds  at  naax  wind  level 

UM 

m/sec 

V-winds  at  max  wind  level 

VM 

m/sec 

Pressure  reduced  to  mean  sea  level 

PR 

Pa 

Geopotential  height 

H 

17 

U-winds 

U 

17 

m/sec 

V-winds 

V 

17 

m/sec 

Ageostrophic  u-winds 

UAG 

17 

m/sec 

Ageostrophic  v-winds 

VAG 

17 

m/sec 

Temperature 

T 

17 

K 

Bulk  Richardson  number 

BRN 

16 

unitless 

Pressure  vertical  velocity 

W 

12 

Pa/sec 

Relative  hunaidity 

RH 

8 

% 
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Equation  ( 20  ),  however,  did  not  account  for  phenomena  such  as  the  shrinking  area 
between  latitude  lines  when  moving  toward  the  poles.  This  was  conq)ensated  for  by 
making  the  gradient  and  Coriolis  parameter  a  function  of  latitude.  The  jc  and  y 
conqwnents  of  the  gradient,  dx  and  dy,  were  made  fimctions  of  latitude  by  transforming 
them  into  spherical  coordinates  (Holton,  1992:33): 

dx  =  r.(x>s((l>)dX 

where 

latitude  (rad) 

A  =  longitude  (rad) 

re  =  radius  of  the  earth  (m). 

The  Coriolis  parameter  was  also  represented  as  (Holton,  1992:13,39): 

/  =  2Qsm(^),  (22) 

where  /2is  the  angular  frequency  of  the  earth  or  7.292  x  10'*  rad/sec. 

Furthermore,  the  derivatives  in  the  gradient  were  estimated  using  finite  difference 
approximations  or  specifically,  centered  differences  of  order  (Axf  (Haltiner  and  Williams, 
1980:109).  The  geostrophic  wind  con:qx)nents  were  then  defined  as: 


1  /f(y  +  Ay)-H(y-Av) 
^  /  2^y 

1  H{x  +  Ax)-H{x-^) 
f  2Ax 


(23) 
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where 


X  coir5)onent  of  the  geostrophic  wind 
Vgy  =  y  conq)onent  of  the  geostrophic  wind 
=  meridional  distance  between  grid  points 
/ix  =  zonal  distance  between  grid  points. 

Finally,  the  difference  in  Equation  (  19  )  was  con:q)uted  and  ageostrophic  winds  were 
obtained.  Vectors  of  total  wind,  geostrophic  wind  and  ageostrophic  wind  were  also 
plotted  in  the  Grad  Anafysis  and  Display  System  (GRADS)  to  ensure  accuracy. 

3. 3. 1.2  Bulk  Richardson  Number. 

Theoretical  work  has  shown  that  the  Richardson  number,  a  ratio  between  buoyancy  and 
wind  shear,  can  be  used  as  an  indicator  of  atmospheric  turbulence  (Stull,  1988:177). 

Since  turbulent  mixing  is  thought  to  cause  fluctuations  in  the  refractive  index,  the 
Richardson  number  may  be  an  indicator  of  optical  turbulence  as  well  (Wamock  and 
VanZandt,  1985:1). 

If  the  Richardson  number  is  less  than  about  0.25,  then  for  the  possibility  for  the  onset  of 
turbulence  is  high.  This  work  was  based  onfy  on  local  observations  of  the  wind  shear  and 
tenqjerature  gradients.  Local  gradients,  however,  are  seldom  known,  but  estimates  of  the 
gradients  can  be  made  at  discrete  height  intervals  using  finite  differences.  These  estimates 
are  included  in  a  more  realistic  ratio  called  the  bulk  Richardson  number. 

The  bulk  Richardson  number,  BRN,  is  defined  as  (Stull,  1988,  177): 


g  Aff  ^  Az 

s,[(At7)’+(Ar)']’ 


(24) 
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where 

g  =  acceleration  due  to  gravity  (m/sec^) 

6  mean  virtual  potential  temperature  (K) 

Az  =  height  interval  (m) 

hU  =  difiference  in  mean  zonal  wind  between  two  layers  (m/sec) 

AV  =  difference  in  mean  meridional  wind  between  two  layers  (m/sec). 

The  differences  in  mean  winds  between  two  layers  were  estimated  using  U  and  V  from  the 

NCEP/NCAR  database  at  two  different  pressure  levels  or: 

AU^U{p^)-U{p,) 

AV»V(p,)-V(p,) 


where  p:  and  pj  are  the  pressures  at  the  top  and  bottom  layers,  respectively.  The  other 
variables  in  ( 24 )  were  derived  from  variables  in  Table  2  as  shown  below. 

The  height  interval  Az  was  confuted  using  the  geopotential  height  H  (List,  1951 :218): 


where  H2  is  the  geopotential  height  at  the  top  layer  and  Hi  is  the  geopotential  height  at  the 
bottom  layer. 

To  calculate  the  mean  virtual  potential  ten:5)erature  ^  and  the  change  in  mean  virtual 
potential  tenperature  A0^ ,  an  ej^ression  had  to  be  found  in  terms  of  known  variables. 
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First,  the  saturated  vapor  pressure  was  defined  fi-om  the  Gofif-Gratch  formula  (Duffield 
and  Nastrom,  1984:10): 

=100(10^‘'“*'‘''^'^"^^)  ,  (26) 

where 

Cs  =  saturated  vapor  pressure  (Pa) 

<7  =  23.832241 
6  =  5.02808  log  T 

c  =  1.3816(10"’^  )(10“-^^'‘“°®^°^’^*^) 

</=  8.1328(10-^X10'-'’''^-'^®'«^^'^) 
e  =  2949.076/T 
T  =  ten^rature  (K). 

Once  the  saturated  vapor  pressure  was  calculated,  it  was  substituted  into  the  equation 
for  saturated  mixing  ratio  Wsati 

^sat=e-^,  (27) 

P-^s 

where  s  is  equal  to  0.622  and  p  is  the  pressure  (Pa).  Third,  the  mixing  ratio  w  was  found 
fi'om  (DuflBeld  and  Nastrom,  1984:12): 

RH 

(28) 

where  RH  is  the  relative  humidity  in  percent.  Fourth,  w  and  w,«  were  put  into  the  virtual 
potential  temperature  equation.  In  saturated  or  cloudy  air,  the  virtual  potential 
temperature  61  was  defined  as: 
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0^  =  e{[+o.6\w^  -Wi), 


(29) 


where 

.  y.286 

6-  potential  temperature  (K) 

Po  =  reference  pressure  (100,000  Pa) 
wl  =  liquid  water  mixing  ratio. 

In  an  unsaturated  environment,  ft  was  defined  as  (Stull,  1988:7): 

ft  =^(l  +  0.61w).  (30) 

The  virtual  potential  tenq)erature  calculated  for  each  layer  was  used  as  an 
approximation  for  the  mean  virtual  potential  temperature  since  meteorological  values  were 
only  available  at  discrete  heights.  Similarly,  the  difference  in  mean  virtual  potential 
temperature  was  estimated  by  the  difference  in  virtual  potential  tenq)erature  between  two 
different  heights.  Finally,  the  BRN  in  equation  ( 24  )  was  calculated. 

Note  that  an  expression  for  the  liquid  water  mbdng  ratio  in  (  29  )  was  not  defined. 
There  was  insufficient  information  to  calculate  the  value  and  furthermore,  the  concept  of 
the  ABL  is  to  fire  upon  missiles  above  clouds  or  above  the  region  where  saturated  air  is 
confined.  Therefore,  values  for  the  BRN  were  flagged  in  regions  of  saturated  air  so  they 
could  be  omitted  in  the  final  data  set  to  be  used  in  the  regression  analysis.  In  addition, 
BRN  values  were  flagged  in  regions  where  the  difference  in  mean  wind  was  zero,  which 
resulted  in  an  infinite  BRN  value,  so  their  values  could  also  be  deleted  jfrom  the  regression 
data  set. 
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3.3.2  Optical  TiirbiJence  Data. 

One  of  the  fiictors  needed  to  compare  time  series  of  data  in  regression  anafysis  is  that 
the  independent  and  dependent  variables  be  measured  simultaneously.  Since  the  Cn^  and 
meteorological  values  were  not  measured  simultaneously,  a  temporal  adjustment  was 
made  to  the  Cn^  data.  In  addition,  spatial  changes  were  made  to  the  Cn^  data  as  described 
below. 

As  shown  in  Table  1,  a  Cn^  measurement  is  made  every  3  minutes  while  the 
meteorological  data  was  measured  in  6-hour  increments.  Rather  than  trying  to  interpolate 
values  for  all  the  variables  listed  in  Table  2,  the  Cn^  values  were  averaged  in  6-hour 
increments  to  coincide  with  the  weather  data.  First,  a  quality  control  check  was 
conducted  on  the  measured  Cn^  values  by  the  sponsor  using  a  program  developed  by 
NCAR  (Weber  and  Wuertz,  Oct  91).  Then,  averages  were  calculated  for  06Z,  12Z,  18Z 
and  OOZ  using  data  available  three  hours  before  and  three  hours  after  each  time  as 
summarized  in  Table  3.  Since  many  of  the  Cn^  measurements  were  missing  or  did  not  pass 
the  quality  control  check,  consideration  was  given  to  the  number  of  observations  available 
for  each  6-hour  time  average.  Therefore,  averages  were  onfy  calculated  where  a  majority 
of  the  observations,  at  least  70%,  was  available. 


Table  3.  Time  and  hours  used  in  to  calculate  optical  turbulence  averages. 


TIME 

HOURS  USED 

IN  AVERAGE 

06Z 

03Z-09Z 

12Z 

09Z-15Z 

18Z 

15Z-21Z 

OOZ 

21Z-03Z 
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Spatial  averages  were  also  computed  for  the  Qj^  data,  after  the  time  averages  were 
completed,  for  brevity  in  the  data  set.  Samples  of  the  data  were  vertically  averaged  at 
different  increments  and  studied  to  determine  an  appropriate  range.  This  range  could 
smooth  some  small-scale  features  while  still  capturing  large-scale  features.  An  increment 
of  300  meters  appeared  to  keep  the  most  interesting  features  in  the  data.  In  addition, 
vertical  averages  were  only  con:q)uted  between  altitudes  of  about  10,000  to  15,000 
meters.  This  range  was  chosen  so  the  tropopause  level  would  be  included,  and  therefore, 
the  jet  stream  would  be  included  since  it  is  most  often  confined  to  a  region  near  the 
tropopause  (Nastrom  and  Eaton,  1995:2137).  The  vertical  levels  used  for  optical 
turbulence  data  averages  are  shown  in  Table  4. 


Table  4.  Vertical  levels  used  for  optical  turbulence  data  averages. 


Level 

Altitudes  Averaged  (m) 

1 

10,122-10,272 

2 

10,422-10,572 

3 

10,722-10,872 

4 

11,022-11,172 

5 

11,322-11,472 

6 

11,622-11,772 

7 

11,922-12,072 

8 

12,222-12,372 

9 

12,522-12,672 

10 

12,822-12,972 

11 

13,122-13,272 

12 

13,422-13,572 

13 

13,722-13,872 

14 

14,022-14,172 

15 

14,322-14,472 

16 

14,622-14,772 

17 

14,922-15,072 

36 


4.  Data  Description  and  Analysis 


4.1  Data  Description 

The  meteorological  and  optical  turbulence  data  sets  used  in  this  study  were  taken  from 
different  archives.  They  are  described  below. 

4.1.1  NCEP/NCAR  Reanalvsis  Data  Set. 

The  basic  concept  of  the  NCEP/NCAR  reanalysis  project  used  a  fixed  state-of-the-art 
analysis/forecast  system  and  performed  data  assimilation  using  past  data  from  1957  to  the 
present  (reanalysis).  This  analysis/forecast  system  would  then  perform  data  assimilation 
into  the  fixture.  This  would  enable  climatologists  to  determine  whether  current  climate 
irregularities  are  significant  when  conpared  to  an  extended  reanalysis  without  changes  in 
the  data  assimilation  system  (Kahiay  and  others,  1996:438).  In  addition,  this  40-year 
reanalysis  provides  a  quality  data  set  appropriate  for  many  studies  such  as  this  one. 

The  reanalysis  archive  was  designed  to  be  comprehensive,  allowing  research  in  budget 
studies,  and  easily  accessible  to  users  in  need  of  extended  periods  of  data.  Since  it  was 
not  possible  to  meet  both  design  requirements  in  one  archive,  several  archival  formats  are 
available  (Kalnay  and  others,  1996:448).  The  meteorological  data  used  in  this  research 
was  taken  from  the  main  synoptic  archive  in  a  gridded  binary  format. 
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The  main  synoptic  archive  is  the  most  conq)rehensive  archive  of  the  reanalysis.  It 
includes  analysis  and  iSrst-guess  fields  at  OOZ,  06Z,  12Z  and  18Z  on  a  2.5®  latitude  and 
longitude  grid  (Kalnay  and  others,  1996:449).  The  reanalysis  gridded  fields  are  also 
categorized  into  classes,  based  on  their  influence  by  observations  and  models  on  the 
gridded  variable.  Variables  extracted  fi’om  the  synoptic  archive  and  their  class  are 
summarized  in  Table  5.  An  ^4  denotes  that  the  variable  is  strongly  influenced  by 
observations;  therefore,  it  is  the  most  dependable  class.  The  letter  B  denotes  that  both 
models  and  observations  have  influence  on  the  variable  (Kalnay  and  others,  1996:448). 


Table  5.  Classification  of  synoptic  archive  variables  used  in  this  study. 


VARIABLE 

SYMBOL 

CLASS 

Surfece  pressure 

PS 

B 

Temperature  at  tropopause 

TT 

A 

Pressure  at  tropopause 

PT 

A 

U-winds  at  tropopause 

UT 

A 

V-winds  at  tropopause 

VT 

A 

Wind  shear  at  tropopause 

SHT 

A 

Sur&ce  lifted  index 

LI 

B 

B 

B 

Temperature  at  max  wind  level 

TM 

A 

Pressure  at  max  wind  level 

PM 

A 

U-winds  at  max  wind  level 

UM 

A 

V-winds  at  max  wind  level 

VM 

A 

Pressure  reduced  to  mean  sea  level 

PR 

A 

Geopotential  height 

H 

A 

U-winds 

U 

A 

V-winds 

V 

A 

Temperature 

T 

A 

Pressure  vertical  velocity 

W 

B 

Relative  humidity 

RH 

B 
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As  mentioned  in  3.3.1,  the  meteorological  data  provided  by  AFRL  included  latitudes 
between  10°  and  65°  N  and  the  entire  longitude  grid.  A  smaller  grid  between  30°-35°N 
and  102°30'-1 10°W  was  extracted  from  this  grid  to  coincide  with  the  radar  site.  Data  was 
provided  in  six  hour  increments  beginning  at  OOZ  for  all  months  in  1991-1996  and  for 
January- June  1997.  There  were  no  missing  database  values;  however,  values  of  the  BRN 
calculated  from  the  database  were  defined  as  missing  for  the  reasons  discussed  in  3.3. 1.2. 

4.1.2  Radar  Data. 

The  measurements  made  by  the  WSMR  50-MHz  radar  were  obtained  and  evaluated 
through  a  quality  control  (QC)  program  conducted  by  the  sponsor.  The  QC  program  was 
designed  by  NCAR  (Weber  and  Wuertz,  Oct  91).  The  logarithm  of  Cn^  was  provided  for 
heights  (above  mean  sea  level)  between  3,222  meters  and  19,722  meters  in  150  meter 
increments.  The  data  was  ten:q)orally  and  vertically  averaged  between  about  10,000  and 
15,000  meters  and  as  described  in  3.3.2. 

The  data  was  available  for  January- April  1993,  July-September  1993  and  January  1994. 
Within  these  dates,  there  were  several  missing  data  values  or  values  that  did  not  pass  the 
QC  check.  Estimated  dates  for  missing  values  or  values  that  friiled  the  QC  check  at 
various  pressure  levels  are  shown  in  Table  6. 


Table  6.  Estimated  days  where  radar  data  was  missing  or  friiled  QC  check. 


Jan  93 

Feb  93 

Mar  93 

Jul93 

Jan  94 

Days 

1-15 

16-20 

13-30 

1-8 

1-8 

29-30 

4-5 

1-3 

27-29 

12-15 

10-20 

18-19 

22-26 
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4.2  Data  Analysis 


Data  analysis  involved  some  autocorrelation  of  the  reanafysis  database  and  radar 
measured  optical  turbulence  values.  Autocorrelation  was  not  as  pertinent  to  the 
regression  analysis  as  originalfy  thought  at  the  beginning  of  this  study  as  ejqilained  below. 
Analysis  also  included  checking  that  the  variables  were  normally  distributed  and  studying 
scatterplots  of  the  optical  turbulence  data  versus  the  reanalysis  variables.  The  focus  of  the 
data  analysis,  however,  was  the  multiple  linear  regression  described  in  detail  below. 

4.2.1  Autocorrelation 

Autocorrelation  can  be  defined  as  the  correlation  of  a  variable  with  itself.  In  the  case  of 
temporal  autocorrelation,  correlations  are  run  on  the  past  and  future  values  of  a  variable. 
These  are  also  known  as  lagged  correlations.  They  can  be  pictured  by  con^aring  two 
copies  of  a  sequence  of  values  with  one  of  the  sequences  shifted  by  a  unit  of  time.  A  lag-1 
correlation,  shifted  one  unit  of  time,  is  usually  calculated  to  measure  persistence;  however, 
autocorrelations  with  lags  greater  than  one  can  be  useful  also  (\\Tlks,  1995:52).  They  can 
be  used  to  help  determine  when  a  variable  becomes  independent  or  is  no  longer  correlated 
with  its  past  value.  They  can  also  be  used  to  determine  seasonality  in  a  variable. 

Autocorrelations  are  computed  using  the  Pearson  correlation  equatioiL  For  a  lag  k,  the 
autocorrelation  coefficient  can  be  computed  fi'om  (Wilks,  1995:53): 


(31) 


40 


where  x_  and  jc+  denote  the  sample  means  from  the  first  and  last  (n  -  k)  data  values, 

respectively.  This  equation  is  only  valid  for  0  ^  ^  —  1 . 

The  collection  of  autocorrelations  for  different  lags  is  known  as  the  autocorrelation 
fimction.  This  function  is  usually  displayed  graphically  with  the  autocorrelation  plotted  as 
a  function  of  lag  k.  Autocorrelations  for  the  data  sets  were  confuted  with  the  SAS 
statistics  package.  An  exanple  of  the  autocorrelation  fimction  is  shown  in  Figure  3  for  the 
meridional  wind  at  the  tropopause,  VT,  at  30°  N  107°30'W  using  1993  data.  Each  lag  k 
is  a  6  hour  increment.  It  shows  a  gradual  decay  toward  zero  as  the  lag  k  increases  which 
is  e3q)ected  for  most  meteorological  variables.  If  the  autocorrelation  function  did  not 
decay  toward  zero  after  a  short  period,  then  accurate  long  term  forecasts  would  be  easy; 
long  term  forecasts  would  simply  be  a  slight  modification  of  the  observation  made  several 
days  prior  (Wilks,  1995:54).  Figure  3  also  shows  that  the  autocorrelation  of  VT  is  near 
zero  or  that  VT  is  independent  around  lag  19,  which  is  approximately  5  days. 


Figure  3.  Autocorrelation  function  for  VT  at  30°  N  107°30'W. 


The  autocorrelation  function  was  used  in  this  study  because  the  original  thought  was 
that  the  variables  used  in  the  regression  could  not  be  serially  correlated.  However,  this  is 
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not  the  case.  The  variables  can  be  strongly  autocorrelated;  however,  the  errors  are 
assumed  to  be  random  and  independent  (refer  to  section  4.2.4, 1).  Therefore,  the 
residuals,  estimates  of  the  error,  cannot  be  highly  autocorrelated  (Wilks,  1995:174).  In 
addition,  there  were  several  gaps  and  missing  values  in  the  final  data  sets  used  for  the 
regression,  so  it  did  not  make  sense  to  perform  autocorrelations  on  them  The  final  data 
sets  were  no  longer  a  true  time  series  of  data  due  to  missing  months  and  days  in  the  radar 
data  and  undefined  bulk  Richardson  niimbers.  As  a  result,  the  autocorrelation  fimctions 
produced  were  not  pertinent  to  the  multiple  regression  analysis  and  were  not  used  fijrther 
in  this  study. 

4.2.2  Normality  Check. 

Since  regression  analysis  usually  assumes  that  its  variables  are  normally  distributed,  this 
distribution  was  checked  first.  Note  that  this  is  not  a  requirement  to  con^lete  a  regression 
analysis.  The  Statistix  and  SAS  con:5)uter  packages  used  the  Wilk-Shapiro  measure  to 
check  for  normality  in  both  the  optical  turbulence  and  meteorological  data.  The  ^\filk- 
Shapiro  value  ranges  fi-om  0  to  1,  with  1  representing  a  perfectly  normal  distribution.  A 
graphical  view  of  the  Wilk-Shapiro  measure  will  be  a  straight  line  for  a  value  of  Wilk- 
Shapiro  value  of  1 .  An  exan^)le  of  the  Wilk-Shapiro  plot  is  shown  in  Figure  4  for  the 
logarithm  of  representing  the  average  around  10,500  meters.  The  Wilk-Shapiro  value 
shown  is  0.9204.  In  most  cases,  if  the  Wilk-Shapiro  value  is  near  0.9,  a  normal 
distribution  can  be  assumed. 

Wilk-Shapiro  values  for  the  logarithm  of  the  for  heights  between  about  10,000  and 

15,000  meters  mostly  fell  near  0.9.  All  values  were  greater  than  0.8,  but  some  were  closer 
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to  0.8  than  to  0.9.  For  the  purposes  of  this  study,  they  were  assumed  to  have  a  normal 
distribution. 

A  large  sanple  of  the  meteorological  variables  was  checked  for  normality  at  each  grid 
point.  The  majority  of  the  variables  passed  the  normality  check.  There  were  a  few 
variables  which  produced  WJk-Shapiro  values  near  0.8,  such  as  relative  humidity  at 
different  levels,  but  were  still  assumed  to  have  a  normal  distribution. 


Wilk-Shapiro  /  Rankit  Plot  of  Log 


Rankits 

Approximate  "Wilk-Slij^jiro  0.9204 


Figure  4.  Wilk-Shapiro  plot  for  log  (Cn^)  around  10,500  m. 


4.2.3  Graphical  Analysis. 

Scatter  plots  of  the  logarithm  of  Cn^  versus  the  meteorological  data  were  studied.  This 
type  of  graphical  analysis  was  used  to  gain  initial  insight  into  the  linear  relationship 
between  the  logarithm  of  Cn^  and  the  reanatysis  data. 
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Scatter  plots  for  the  logarithm  of  Cn^  averaged  around  10,500  meters  versus  various 
weather  variables  from  January  15  to  January  30  of  1993  are  shown  in  Figure  5,  Figure  6 
and  Figure  7.  The  approximated  regression  line  is  also  displayed  in  the  figures.  A  linear 
relationship  is  distinguishable  in  the  scatter  plot  of  the  logarithm  of  versus  zonal  wind 
at  the  max-wind  level  (UM)  depicted  in  Figure  6.  A  linear  relationship  is  not  as  apparent, 
however,  in  Figure  5  and  Figure  7.  After  studying  several  scatter  plots  for  different  grid 
points,  it  was  determined  that  there  was  potential  for  a  linear  relationship  between  the 
synoptic  scale  weather  variables  and  the  logarithm  of  Cn^. 


Scatter  Plot  of  Log  (C^^)  vs.  PM 


Figure  5.  Scatter  plot  of  log  (Cn^)  vs.  pressure  at  the  max-wind  level  (PM). 


4.2.4  Regression  Analysis. 

Regression  analysis  is  a  powerful  statistical  tool  used  to  assess  the  relationship  between 
one  dependent  variable  and  one  or  more  independent  variables.  It  is  also  useful  because 
the  independent  variables  can  be  somewhat  correlated  with  one  another  (Tabachnick  and 
Fidell,  1983:86). 

Regression  analysis  is  most  easily  explained  by  the  case  of  simple  linear  regression,  a 
special  case  of  multiple  linear  regression  (Wilks,  1995:160).  This  includes  describing  the 
anafysis  of  variance,  goodness-of-fit  measures  and  the  analysis  of  residuals.  Regression 
analysis  can  then  be  easity  e?q>anded  to  the  general  case  of  multiple  linear  regression.  If 
there  are  a  large  number  of  predictor  variables,  as  in  this  study,  then  variable  selection  for 
the  multiple  linear  regression  can  be  automated  using  a  stepwise  technique. 

4.2.4. 1  Simple  Linear  Reeressioa 

The  concept  of  simple  linear  regression  is  to  capture  the  relationship  between  two 
variables,  given  as  x  and  y,  in  a  single  straight  line.  By  convention,  x  is  used  for  the 
independent  or  predictor  variable  and  y  is  used  for  the  dependent  variable  or  predictand. 
The  regression  operation  selects  the  line  which  produces  the  least  error  for  predictions  of 
y,  given  observations  ofx  (Tabachnick  and  Fidell,  1983:87;  Wilks,  1995:160;  Devore, 
1995:475).  Least  error  usually  means  that  the  sum  of  squared  errors  is  minimized. 

Figure  8  depicts  the  situation.  Given  a  data  set  of  (x,  y)  pairs,  the  problem  lies  in 
determining  the  straight  line  which  minimizes  the  vertical  distance  between  the  line  and  the 
data  points.  The  equation  for  the  line  can  be  expressed  as: 

y  =  a+bx,  (32) 
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where  y  is  the  predicted  value  of  a  is  the  least-squares  intercept  and  b  is  the  slope  of 
the  line.  The  vertical  distances  between  the  data  points  and  line,  called  residuals  or 
estimates  of  the  error,  are  defined  as: 

(33) 

where  there  is  a  different  residual  e,  for  each  data  pair  (x,  ,>»,  )  (Wilks,  1995:160;  Devore, 
1995:485).  It  is  assumed  that  the  errors  are  independent,  random  variables  with  zero 
mean  and  constant  variance.  In  addition,  it  is  often  assumed  that  the  errors  follow  a 
normal  distribution  (Wilks,  1 995 : 1 63). 


Figure  8.  Schematic  illustration  of  simple  linear  regression  (Wilks,  1995:161). 
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Combining  equations  (  32  )  and  ( 33  )  gives  the  regression  equation: 


=3',+^,  =a-\-bx  +  e,. 


(34) 


which  shows  that  the  actual  value  of  the  predictand  is  the  sum  of  the  predicted  value  and 
residual  (Tabachnick  and  Fidell,  1983:87;  )^^lks,  1995: 161;  Devore,  1995:477).  In  order 
to  minimize  the  sum  of  squared  errors,  the  least-squares  intercept  a  and  slope  b  in  (  34  ) 
must  satisfy  the  following  analytic  ejqiressions  (Wilks,  1995:162): 
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(36) 


where  n  is  the  number  of  data  pairs. 

4.2.4.2  Analysis  of  Variance. 

In  4.2.4. 1 ,  the  assumption  of  zero  mean  and  constant  variance  in  the  errors  was 
presented.  This  constant  variance  can  be  estimated  using  the  residuals  in: 


1 

n-2 


n 


i=l 


(37) 


where  is  the  sum  of  squared  residuals  (Wilks,  1995:163). 

Rather  than  calculating  the  sample  variance  from  (  37  ),  it  is  more  common  to  use  a 
form  based  on  the  relationship: 

SST  =  SSR  +  SSE,  (38) 
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where  SST  is  the  total  sum  of  squares,  SSR  is  the  regression  sum  of  squares  and  SSE  is  the 
sum  of  squared  errors.  SST  can  be  defined  as  the  sum  of  squared  deviations  ofj/  around 
the  mean  ofy  or  (Wilks,  1995:164): 

=  (39) 

i=l 

where  y  is  the  mean  of  This  is  proportional  to  the  estimated  variance  of>)  and, 
therefore,  measures  the  overall  variability  of  the  predictand.  The  SSR  term  is  the  sum  of 
squared  differences  between  the  regression  predictions  and  the  estimated  mean  of  or: 

.  (40) 

i=l 

This  can  be  related  to  the  regression  equation  in  ( 34  )  by: 

,  (41) 

/=i 

where  jc  denotes  the  mean  of  x.  Finally,  SSE  is  defined  as  the  sum  of  squared  differences 
between  the  residuals  and  their  mean: 

n 

SSE  =  'Zef.  (42) 

i=l 

The  estimated  error  variance  can  now  be  shown  as: 

si  =  -^{SST-SSR)  =  -^SSE  (  43  ) 

The  output  fi'om  many  statistical  computer  packages  includes  the  analysis  of  variance 
(ANOVA)  parameters  just  described,  in  an  ANOVA  table.  A  generic  form  of  the 
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ANOVA  table  is  shown  in  Table  7.  It  includes  the  source  of  the  variance  in  column  1,  the 
degrees  of  freedom  (DF)  in  column  2,  the  sum  of  squared  (SS)  values  in  column  3  and  the 
mean  squared  (MS)  values  in  column  4.  The  three  rows  comprise  the  error,  regression 
and  total  values  for  each  column.  Note  that  error  and  regression  values  in  the  DF  and  SS 
columns  will  sum  to  the  corresponding  value  in  the  total  row  (Wilks,  1995:165).  The  MS 
values  are  given  by  the  corresponding  values  of  SS/DF.  For  simple  linear  regression,  there 
is  one  regression  degree  of  freedom,  so  SSR  is  equal  to  the  mean  sqiiare  of  regression, 
MSR.  Also,  it  was  just  shown  that  sl=SSEAyF,  so  is  also  equal  to  the  mean  squared 
error,  MSB.  The  other  value  in  the  MS  column,  the  F  ratio,  is  a  measure  of  the  fit  of  a 
regression.  This  will  be  discussed  further  in  4.2.4.3. 


Table  7.  ANOVA  Table  for  Simple  Linear  Regression  (Wilks,  1995:166). 


SOURCB 

DF 

SS 

MS 

Total 

n- 1 

SST 

1 

SSR 

Brror 

n-2 

SSB 

MSB=j,' 

4.2.4.3  Goodness-of-Fit  Measures. 

Several  measures  can  be  used  to  determine  the  fit  of  a  regression,  or  how  well  the 
regression  line  portrays  the  scatter  plot  of  the  data.  Three  commonly  used  measures  of 
regression  fit  are  the  mean  squared  error,  MSB,  the  coefficient  of  determination  or  and 
the  F  ratio. 
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The  MSE  is  probably  the  most  basic  of  the  three  measures  since  it  shows  the  variability 
of  the  observed  y  values  around  the  regression  line.  Since  MSE  =  s] ,  this  shows  how  the 

residuals  will  fell  around  the  regression  line.  If  MSE  is  small,  the  residuals  wiD  be  in  close 
proximity  to  the  regression  line  and  will  most  likely  provide  a  good  fit  as  shown  in  Figure 
9  (Wilks,  1995:166).  If  MSE  is  large,  they  will  be  spread  fiirther  from  the  regression  line 
and  will  be  more  likely  to  provide  a  unsatisfactory  regression  as  illustrated  in  Figure  10. 


Figure  9.  Small  MSE  indicating  a  fairly  good  regression  relationship  (Wilks,  1995:167). 
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Figure  10.  Large  MSB  showing  a  poor  regression  relationship  (Wlks,  1995: 167). 


The  second  measure  of  the  strength  of  a  regression  is  the  coefficient  of  determination  or 
This  can  be  calculated  jfrom: 


2_^_i  ™ 

^  ~  SST~  ~  SST 


(44) 


is  understood  to  be  the  proportion  of  the  variability  of  the  predictand  that  is  accounted 
for  or  e5q)lained  by  the  regression  (Devore,  1995:489).  For  a  perfect  regression,  SSR  = 
SST,  SSE  =  0,  therefore  =  1.  For  a  worthless  regression,  SSR  =  0,  SSE  -  SST, 
therefore  R  =  0.  Figure  10  can  also  exemplify  this  poor  case  of  regression  as  a  R  value 
close  to  zero  (Wilks,  1995:167). 

The  third  measure  of  fit  of  a  regression  is  the  F  ratio.  As  shown  in  Table  7,  F  is  equal 
to  the  ratio  of  MSR  to  MSB;  therefore,  F  increases  with  the  strength  of  the  regression, 
since  a  strong  relationship  between  x  and  y  generates  a  large  MSR  and  small  MSB  (Wilks, 
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1995:168).  It  is  based  upon  the  F  distribution  which  has  two  parameters.  The  first 
parameter  is  the  number  of  numerator  degrees  of  fi’eedom  and  the  second  is  the  niunber  of 
denominator  degrees  of  fi'eedom.  A  typical  F  density  curve  is  shown  in  Figure  1 1  with  a 
corresponding  upper  tail  critical  value,  Fc.  The  area  to  the  right  of  the  critical  value  under 
the  curve  represents  the  significance  level,  a,  of  the  test  usually  assumed  to  be  5%  or  0.05. 
In  the  SAS  statistics  package,  the  F  ratio  is  for  testing  under  the  null  hypothesis  that  all 
parameters  other  than  the  intercept  is  zero.  The  alternate  hypothesis  is  that  at  least  one  of 
the  parameters  is  not  zero.  If  the  F  ratio  is  less  than  Fc  or  if  the  probability  value  (p- value) 
of  F  is  greater  than  a,  then  there  is  not  enough  evidence  to  reject  the  null  hypothesis. 
Therefore,  the  predictors  do  not  contribute  to  the  regression. 


X 


Figure  1 1 .  An  example  of  the  F  distribution  curve  (Devore,  1995:397). 

4.2.4A  Analysis  of  Residuals. 

Some  of  the  results  of  statistics  software  packages  can  be  misleading  if  the  underlying 
assumptions  related  to  errors  are  not  satisfied.  As  stated  in  4.2.4. 1,  errors  were  assumed 
to  be  independent,  random  variables  with  zero  mean  and  constant  variance,  also  called 
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homoscedasticity.  In  addition,  the  errors  were  assumed  to  have  a  normal  distribution. 
These  assumptions  can  be  checked  using  estimators  of  error  or  residuals. 

An  easy  and  basic  check  on  residuals  can  be  made  by  examining  a  scatter  plot  of  the 
residuals,  e,  against  the  predicted  values,  y  (Tabachnick  and  Fidell,  1983:93;  Wilks, 
1995:171).  If  the  residuals  give  the  appearance  of  a  horizontal  band  around  their  zero 
mean  as  illustrated  in  Figure  12,  then  the  assumption  of  homoscedasticity  is  supported 
(Wilks,  1995:172). 

The  Wilk-Shapiro  measure  may  be  used  to  determine  whether  the  residuals  follow  a 
normal  distribution.  If  the  Wilk-Shapiro  value  is  near  0.9  with  a  near  linear  plot,  a  normal 
distribution  of  the  residuals  can  be  assumed. 

Finally,  the  independence  of  residuals  can  be  checked  if  the  original  data  was  temporally 
correlated.  A  plot  ofthe  residuals  as  a  fimctionoftime  can  be  examined.  If  groups  of 
positive  and  negative  residuals  tend  to  bunch  together,  rather  than  occurring  more 
randomly,  then  serial  correlation  can  be  suspected.  The  Durbin- Watson  test  statistic  may 
also  be  used.  This  test  examines  the  null  hypothesis  that  the  residuals  are  serially 
correlated  against  the  altemate  hypothesis  that  they  are  consistent  with  a  first-order 
autoregressive  process.  The  Durbin- Watson  test  statistic,  d,  can  be  expressed  as: 

i(e,-ea 

-  (45) 

Lef 

i=l 

This  statistic  basically  calculates  the  squared  differences  between  pairs  of  successive 
residuals  and  divides  them  by  a  scaling  fector.  If  the  residuals  are  positively  correlated,  d 
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will  be  relatively  small  (Wilks,  1995:173).  If  the  residuals  are  randondy  distributed  in 
time,  then  c/  will  be  relatively  large.  As  mentioned  in  4.2. 1 ,  even  though  a  regression  uses 
strongly  autocorrelated  variables  for  both  predictand  and  predictors,  the  residuals  will  not 
necessarily  be  strongly  autocorrelated  themselves  (Wilks,  1995:174). 


Figure  12.  Scatter  plot  of  residuals  vs.  predicted  values  (Wilks,  1995:172). 

4.2.4. 5  Multiple  Linear  Regression. 

Multiple  linear  regression  is  the  more  general  case  of  linear  regression.  It  is  similar  to 
simple  linear  regression  where  there  is  a  predictand,  y,  but  there  is  more  than  one  predictor 
variable,  x.  The  previous  discussion  of  simple  linear  regression  will  generalize  to  the 
situation  of  multiple  linear  regression. 
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Let  K  be  the  number  of  predictor  variables.  Simple  linear  regression  was  just  the 
special  case  where  K=l.  The  prediction  equation  for  multiple  linear  regression  can  then 
be  given  as: 

y  =  bf,+  biXi  +  62-^2  +  •  •  •  +  »  (  46 ) 

where  b^  is  the  intercept,  bfr  is  the  coefficient,  comparable  to  the  slope,  for  each  predictor 
variable  Xf.  (Wilks,  1995:176;  Devore,  1995:550). 

Equation  ( 33  )  for  the  residuals  still  applies;  however,  the  fact  that  the  predicted  value 
j)  is  a  function  of  a  vector  of  predictors,  Xjr ,  must  be  taken  into  account.  If  there  are  two 
predictor  variables,  K=2,  then  the  residual  can  still  be  pictured  as  a  vertical  distance  from 
the  regression  surface,  rather  than  just  a  line.  For  more  than  two  predictors,  the  geometric 
situation  is  similar,  but  difficult  to  picture. 

The  (iT  +  1)  parameters  are  foimd  similarly  to  simple  linear  regression  by  minimizing  the 
sum  of  squared  errors.  This  is  done  most  easily  by  solving  simultaneous  equations  using 
matrix  algebra  or  by  using  statistical  software  packages.  The  results  can  also  be 
summarized  in  an  ANOVA  table  for  multiple  regression  as  shown  in  Table  8.  Notice  that 
the  calciilations  are  analogous  to  those  in  simple  linear  regression  with  the  only  difference 
being  the  degrees  of  freedom. 


Table  8.  ANOVA  Table  for  Multiple  Linear  Regression  (^^^lks,  1995:177). 


SOURCE 

DF 

SS 

MS 

Total 

n- 1 

SST 

K 

SSR 

Error 

n-K-1 

SSE 

MSE  =  SSE/(n-K- 1)=  5," 
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4.2.4.6  Stepwise  Reeressioa 


Multiple  regression  analysis  seems  to  enable  an  endless  number  of  potential  predictor 
variables.  An  initial  list  can  be  multiplied  significantly  by  using  mathematical 
transformations  such  as  the  square  of  predictors  or  products  of  predictors,  called 
interaction  terms  (Wilks,  1995:177;  Devore,  1995:553).  In  this  study,  there  are  151 
potential  predictors  (shown  in  Table  2)  with  approximately  2.85  x  10'*^  (2‘**)  combinations 
of  modeb.  With  such  a  large  number  of  predictors,  explicit  examination  of  all  possible 
subsets  is  impossible,  especially  within  the  three  months  this  study  was  conducted.  There 
are  however,  alternative  methods  that  can  be  used  to  identify  good  regression  models 
(Devore,  1995:571).  The  method  chosen  for  this  study  was  stepwise  regression. 

Stepwise  regression  uses  a  combination  of  a  forward  selection  and  backward 
elimination  method.  The  procedure  starts  by  adding  predictor  variables  to  the  model  one 
by  one.  After  each  addition,  the  statistics  package  examines  those  variables  previously 
entered  to  determine  if  they  are  still  statistically  significant  to  the  model.  If  they  are  no 
longer  significant,  they  are  eliminated  fi'om  the  model  (Devore,  1995:576).  In  the  SAS 
statistics  package,  predictor  variables  are  examined  for  addition  or  deletion  based  on  an  F 
statistic  at  the  0.15  significance  level. 

Stepwise  regression  was  chosen  for  many  reasons.  First,  parsimony  where  less  is  more 
in  terms  of  predictors,  was  a  major  goal  for  each  model  (Cody  and  Smith,  1991 :220). 
Stepwise  regression  would  ensure  that  each  variable  provided  a  significant  contribution  to 
the  model  rather  than  just  trying  to  obtain  the  largest  coefficient  of  determination  with 
insignificant  predictors.  Since  the  coefficient  of  determination  increases  with  the  number 
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of  predictor  variables,  this  would  most  likely  produce  an  overfit  model  (Devore, 
1995:572).  Stepwise  regression  would  also  ensure  that  insignificant  predictors  were 
omitted,  unUke  either  the  forward  selection  or  backward  elimination  procedure.  Second, 
with  the  number  of  potential  predictors,  confuting  processing  time  would  not  be  a 
problem.  A  procedure  to  find  the  largest  coefficient  of  determination  was  available; 
however,  it  finds  ^  values  for  all  possible  combinations  of  the  model  As  mentioned 
previously,  this  was  not  an  option  due  to  the  large  number  of  potential  predictors.  Finally, 
there  is  a  smaller  chance  of  mukicoUinearity,  or  high  correlation  between  predictors  which 
can  give  misleading  results  when  using  stepwise  regression. 
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5.  Findings  and  Conclusions 


5.1  Results 

Results  of  the  stepwise  regression  procedure  for  latitudes  and  longitudes  between  30- 
35°N,  102°30'-1 10®W  indicate  a  weak  linear  relationship  between  the  logarithm  of  Cn^  at 
various  altitudes  with  different  synoptic  scale  meteorological  variables.  This  was 
confirmed  by  first  validating  results  through  analysis  of  residuals  and  then  by  examining 
the  values,  the  MSB  terms  and  the  F  ratios. 

Analysis  of  residuals  was  conducted  by  first  examining  scatter  plots  of  residuals  versus 
predicted  values,  to  ensure  the  assun^tion  of  homoscedasticity  in  the  errors  was  met. 
Analysis  of  residuals  then  included  testing  the  assumption  of  normality  using  the  Wilk- 
Shapiro  normality  test.  The  assumptions  of  homoscedasticity  and  normality  in  the  errors 
appeared  to  be  supported.  The  Durbin-Watson  test  for  serial  correlation  of  the  data  was 
not  conducted  because  the  data  used  in  the  regression  analysis  was  not  a  true  time  series. 
Since  testing  variables  for  independence  with  several  gaps  in  the  data  did  not  make  sense, 
it  was  assumed  that  the  errors  were  independent  and  random. 

The  R^  values  for  all  the  regression  models  are  summarized  in  Table  9.  They  are 
arranged  in  latitude  and  longitude  coordinates  versus  averaged  vertical  level  given  in  Table 
4.  All  R^  values  are  statistically  significant  at  the  0.05  significance  level.  The  values  range 
fi-om  0.130  (35°N,  102“30'W,  level  16)  to  0.682  (32^30^  107°30'W,  level  9),  which  is 
relatively  distant  fi’om  the  R^  value  of  1  indicating  a  perfect  regression;  however,  the 
majority  of  the  values  are  near  0.5  indicating  a  weak  relationship.  Further  examination 
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of  Table  9  shows  that  the  highest  values,  where  the  strongest  linear  relations!)^  can  be 
assumed,  M  between  levels  7  through  12  or  altitudes  of  about  12,000  to  13,500  meters. 
The  three  highest  values  are  located  between  levels  9  and  1 1  (altitudes  of 
approximately  12,600  to  13,200  meters)  at  32°30'N,  107°30'W,  which  is  the  grid  point 
located  closest  to  the  radar  site  at  32°24'N,  106°2rW.  Lowest  values,  indicating  the 
weakest  relationships,  are  between  levels  16  and  17  or  altitudes  of  about  14,600  to  15,000 
meters,  respectively. 


Table  9,  values  for  latitude/longitude  coordinates  vs.  averaged  vertical  level. 


M 

32.5 

102.5 

35 

102.5 

30 

105 

32.5 

105 

35 

105 

30 

107.5 

32.5 

107.5 

35 

107.5 

30 

no 

32.5 

no 

LEVEL 

1 

.487 

.492 

.503 

,586 

.524 

.541 

.594 

.596 

.451 

.608 

.567 

.503 

2 

.522 

.554 

.557 

.556 

.602 

.526 

.536 

.609 

.583 

.481 

.617 

.539 

3 

.494 

.527 

.511 

.524 

.489 

.540 

.535 

.534 

.539 

.529 

.542 

.559 

4 

.493 

.524 

.506 

.481 

.597 

.466 

.509 

.520 

.522 

.556 

.566 

.484 

5 

.562 

.480 

.477 

.578 

.581 

.434 

.610 

.564 

.532 

.577 

.606 

.557 

6 

.408 

.504 

.374 

.587 

.562 

.430 

.591 

.584 

.502 

.508 

.609 

.596 

7 

.427 

.542 

.429 

.432 

.619 

.522 

.589 

.606 

.526 

.520 

.30 

.565 

8 

.499 

.581 

.528 

.565 

.671 

.493 

.623 

.551 

.565 

.508 

.583 

.542 

9 

.477 

.451 

.577 

.578 

.596 

.548 

.632 

.682 

.623 

.637 

.580 

.594 

10 

.457 

.465 

.505 

.514 

.613 

.559 

.580 

.677 

.584 

.612 

.574 

,550 

11 

.450 

.432 

.415 

.435 

.571 

.506 

.602 

.668 

.623 

.615 

.635 

.609 

12 

.404 

.413 

.444 

.505 

.605 

.463 

.578 

.555 

.514 

.552 

.527 

.608 

13 

.497 

.266 

.394 

.436 

.450 

.454 

.535 

.525 

.238 

.500 

.477 

.306 

14 

.346 

.196 

.209 

.442 

.489 

.334 

.548 

.517 

.436 

.419 

.476 

.394 

15 

.324 

.286 

.292 

.333 

.441 

.249 

.394 

.526 

.306 

.507 

.468 

.413 

16 

.361 

,201 

.130 

.396 

.353 

.219 

.367 

.376 

.313 

.250 

.257 

.351 

17 

.319 

.198 

.305 

.364 

.277 

.269 

.306 

.351 

.238 

.315 

.346 

.357 

The  MSB  values  shown  in  Table  10  range  from  0.104  (30°N,  105®W,  level  17)  to  0.324 
(30'’N,  102®30'W,  level  7).  The  MSB  terms  appear  to  be  relatively  small;  however,  it  is 


difficult  to  visualize  the  MSB  using  the  distance  from  the  residuals  to  the  regression 
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sur&ce  with  so  many  predictors,  so  the  F  ratio  was  used  to  test  for  the  linear  relationship 
that  may  exist  between  log  (Q,^)  and  the  weather  variables. 


Table  10.  MSB  values  for  latitude/longitude  coordinates  vs.  averaged  vertical  level. 


m 

32.5 

102.5 

35 

102.5 

30 

105 

32.5 

105 

35 

105 

m 

32.5 

107.5 

35 

107.5 

30 

110 

32.5 

no 

35 

no 

LEVEL 

HU 

1 

.265 

.263 

.260 

.224 

.250 

.244 

.230 

.223 

.269 

.225 

.242 

.262 

2 

.255 

.243 

.255 

.244 

.219 

.256 

.253 

.224 

.221 

.282 

.222 

.262 

3 

.259 

.253 

.271 

.241 

.269 

.247 

.257 

.251 

.242 

.252 

.252 

.246 

4 

.244 

.246 

.258 

.256 

.210 

.271 

.258 

.238 

.239 

.235 

.235 

.262 

5 

.203 

.247 

.246 

.206 

.197 

.258 

.196 

.209 

.203 

.202 

.200 

.213 

6 

.291 

.256 

.320 

.216 

.234 

.274 

.226 

.226 

.238 

.256 

.215 

.218 

7 

.324 

.262 

.317 

.314 

.226 

.262 

.247 

.223 

.254 

.273 

.218 

.240 

8 

.275 

.236 

.255 

.229 

.196 

.264 

.216 

.238 

.240 

.260 

.229 

.259 

9 

.243 

.256 

.206 

.193 

.195 

.213 

.179 

.160 

.183 

.178 

.204 

.200 

10 

.223 

.221 

.201 

.194 

.172 

.177 

.178 

.139 

.174 

.160 

.176 

.189 

11 

.235 

.244 

.251 

.230 

.201 

.209 

.184 

.151 

.169 

.177 

.168 

.184 

12 

.231 

.222 

.215 

.192  ’ 

.169 

.197 

.175 

.191 

.183 

.176 

.187 

.170 

13 

.202 

.263 

.229 

.212 

.209 

.204 

.187 

.195 

.280 

.191 

.195 

.262 

14 

.240 

.283 

.286 

.209 

.203 

.250 

.185 

.199 

.219 

.221 

.206 

.233 

15 

.218 

.236 

.229 

.191 

.191 

.215 

.212 

.188 

.204 

.173 

.188 

.204 

16 

.135 

.162 

.186 

.139 

.128 

.159 

.142 

.143 

.136 

.155 

.154 

.137 

17 

.117 

.134 

.112 

.104 

.123 

.114 

.120 

.113 

.137 

.114 

.109 

.118 

2 

The  F  test  shown  below  was  conducted  on  the  regression  model  with  the  largest  R 
value  of 0.682.  This  model  was  for  the  grid  point  at  32°30'N,  107°30'W  at  an  altitude  of 
about  12,600  meters.  The  ANOVA  table  for  this  regression  model,  which  includes  the  F 
statistic  and  its  corresponding  probability  value  (p-value),  is  given  in  Table  1 1 . 


Table  11.  ANOVA  table  for  32°30'N,  107“30'W,  12,600  m. 


SOURCE 

DF 

SS 

MS 

Total 

263 

115.541 

34 

78.803 

2.318  (F  =  14.45,  p-value  =  0) 

Error 

229 

36.738 

0.160 
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The  assunqjtions  of  homoscedasticity  and  normality  in  the  errors  were  checked  first  to 
ensure  that  the  results  fi'om  the  SAS  statistics  package  were  valid  for  this  grid  point.  The 
scatter  plot  of  residuals  versus  predicted  values  of  log  (Cn^),  plotted  using  SAS,  is  shown 
in  Figure  13.  There  appears  to  be  an  equal  amount  of  variance  above  and  below  zero  or 
the  ejected  value  of  the  error.  The  general  shape  of  a  horizontal  band  around  the  zero 
line  can  be  seen;  therefore,  the  assun5)tion  of  homoscedasticity  is  supported.  The  Wilk- 
Shapiro  plot  of  the  residuals  is  shown  in  Figure  14.  The  plot  shows  a  strong  linear  fit  with 
a  Wilk-Shapiro  value  of 0.9970;  therefore,  the  assunqrtion  of  normality  is  strongly 
supported.  The  model  results  fi'om  the  SAS  package  were  then  deemed  valid  and  the  F 
test  was  conducted. 
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Figure  13.  Scatter  plot  of  residuals  vs.  predicted  values  (32®30'N,  107°30'W,  12,600  m) . 


The  null  hypothesis  for  this  F  test  is  that  all  parameters  except  for  the  intercept  are 
zero.  The  alternate  hypothesis  is  that  at  least  one  parameter  is  not  zero.  The  situation  is 
illustrated  in  Figure  15.  First,  an  F  distribution  curve  was  plotted  using  the  regression 
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degrees  of  freedom,  34,  as  dl  and  the  error  degrees  of  freedom,  229,  as  d2.  Assiiming  a 
significance  level  a  of  0.05,  a  critical  value  of  F,  denoted  Fc,  was  also  computed.  Since 
F  =  14.45  is  larger  than  Fc  =  1.481  or  the  corresponding  p-value  =  0  is  less  than  a  =  0.05, 
the  null  was  rejected.  Therefore,  the  independent  variables  contribute  to  the  regression  or 
to  predicting  values  of  log  (Cn^). 


WiDc-Shpiro  /  Rankit  Plot  of  Resid 


Rankits 

Approximate  "Wilk-Shapiro  0.9970 


Figure  14.  Wfilk-Shpiro  plot  of  residuals  (32®30'N,  107°30'W,  12,600  m). 

F  ratios  and  their  corresponding  p-values  for  all  regression  models  are  summarized  in 
Table  12.  Shorthand  notation  was  used  for  p-values  less  than  zero  with  E  representing  the 
power  of  10  (for  exanple,  3E-16  =  3  x  10'*®).  All  F  ratios  are  all  statistically  significant 
assuming  a  0.05  significance  level.  In  other  words,  the  meteorological  variables  contribute 
to  the  regression  or  to  predicting  values  of  log  (Cn^)  for  all  models. 
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Figure  15.  F  distribution  curve  for  32®30'N,  107®30'W,  12,600  m. 


The  number  of  predictors  in  each  model,  summarized  in  Table  13,  varied  from  7  up  to 
42  variables.  Most  models  with  a  large  number  of  predictors  showed  a  relatively  high 
value  while  models  with  a  small  number  of  predictors  showed  a  low  value.  The 
number  of  predictors  was  the  lowest  at  levels  16  and  17.  There  was  no  apparent  trend 
between  the  number  of  predictors  and  mean  squared  error. 

A  plot  of  predictor  variables  versus  their  frequency  of  occurrence  in  all  regression 
models  is  shown  in  Figure  16.  The  last  nine  predictors  have  frequencies  much  greater  than 
the  first  13  variables  because  they  are  available  at  several  different  pressure  levels.  The 
predominant  variables  were  tenq)erature,  mid  and  upper  level  wind  and  ageostrophic  wind 
components  and  the  bulk  Richardson  number.  A  list  of  the  top  10  predictors  in 
descending  order  of  frequency  is  given  in  Table  14.  The  total  wind,  ageostrophic  wind 
and  bulk  Richardson  number  were  expected  to  be  predictors  of  Cn^  as  stated  in  2.3.  The 
temperature  was  not  ejq>ected  to  be  a  predictor  based  on  the  literature;  however,  based  on 
theory,  it  is  logical  that  temperature  would  be  a  included  as  a  predictor  of  because  the 
refractive  index  depends  on  ten^erature  for  electromagnetic  waves.  The  refiactive  index 
is  also  a  frmction  of  humidity,  but  near  the  stratosphere,  the  humidity  can  be  considered 
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negligible.  The  wind  components  at  the  tropopause  did  not  contribute  to  most  models  as 
predicted.  This  is  probably  due  to  the  feet  that  the  wind  at  the  tropopause  is  highly 
correlated  with  mid  and  upper  level  winds. 


Table  12.  F  and  p-values  for  ktitude/longitude  coordinates  vs.  averaged  vertical  level. 
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Table  13.  Number  of  predictor  variables  in  each  regression  model 
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5.2  Conclusions 

This  study  has  shown  that  a  linear  relationship  exists  between  the  NCEP/NCAR 
reanalysis  database  variables  and  the  averaged  logarithm  of  Cn^  values  measured  at  WSMR 
for  a  discontinuous  period  between  January  1993  to  January  1994.  The  strongest 
relationships  found  were  located  closest  to  the  radar  site,  which  was  somewhat  intuitive. 
The  most  significant  predictors  in  this  study  were  temperature,  total  and  ageostrophic 
wind  components  and  the  bulk  Richardson  number.  This  study  also  showed  that  although 
a  linear  relationship  exists,  this  relationship  is  a  weak  one  with  several  con^licated 
regression  models. 
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Based  on  the  results,  it  was  concluded  that  the  exact  relationship  between  optical 
turbulence  and  synoptic  scale  variables  could  not  be  defined  at  this  time;  further  research 
was  needed.  Suggestions  that  may  strengthen  this  relationship  is  provided  next. 


Figure  16.  Predictor  variable  vs.  fi-equency  of  occurrence  in  all  regression  models. 


Table  14.  Top  10  predominant  predictor  variables  in  descending  order  of  firequency. 


VARIABLE 

SYMBOL 

Temperature 

T 

Ageostrophic  u-winds 

UAG 

U-winds 

U 

Ageostrophic  v-winds 

VAG 

Bulk  Richardson  number 

BRN 

V-winds 

V 

Pressure  vertical  velocity 

W 

Relative  humidity 

RH 

Geopotential  height 

H 

Best  (4-layer)  index 

B 
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5.3  Recommendations  for  Further  Research 


The  results  from  this  study  showed  that  a  linear  relationship  existed  between  averaged 
optical  turbulence  values  and  synoptic  scale  variables,  but  it  was  a  weak  relationship.  The 
limited  scope  of  this  study  did  not  permit  exploration  of  other  potential  predictor  variables 
which  may  strengthen  the  linear  relationship.  First,  categorizing  the  data  by  vertical  levels 
may  not  have  been  appropriate;  mapping  synoptic  scale  meteorological  data  to  its 
suspected  source  region  and  using  these  values  as  potential  predictors  may  provide  better 
results.  For  example,  vertical  distance  from  the  tropopause  or  maximum  wind  level  and 
horizontal  distance  from  jet  core  could  be  used  as  potential  predictors.  Second,  a  linear 
regression  using  time  derivatives  of  specific  meteorological  parameters  may  also  show  a 
stronger  relationship.  Other  potential  predictors  such  as  the  square  of  a  predictor  or  the 
product  of  one  or  more  potential  predictors  could  be  used  also.  In  this  case,  if  a  strong 
relationship  was  found,  h  would  indicate  a  strong  non-linear  relationship. 

In  addition,  a  follow-on  study  should  examine  specific  case  studies  that  were  available 
in  the  data  sets.  Since  a  weak  relationship  was  found  for  the  discontinuous  period 
between  January  1993  to  January  1994,  a  stronger  relations!]^  may  exist  for  certain 
seasons  of  the  year.  It  may  be  advantageous  to  focus  on  a  specific  feature,  such  as  the 
predominant  predictor  variables  in  Table  14  or  the  tropopause  height,  which  migrates  with 
season.  A  linear  or  non-linear  regression  for  specific  periods  of  the  year  may  provide  a 
stronger  statistical  relationship. 

Although  averaging  optical  turbulence  data  is  common,  it  may  not  have  best  choice. 

For  example,  time  averaging  of  3-minute  increments  to  6  hour  increments  may  have 
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affected  the  relationship  by  smoothing  out  interesting  features  in  the  data.  No 
literature  was  found  that  addressed  such  a  problem;  however,  a  study  relating  Cn^  to 
synoptic  scale  weather  parameters  with  different  time  and  spatial  increments  may  provide 
better  results. 

Finally,  synoptic  scale  variables  may  not  fully  e)q)lain  the  fluctuations  in  optical 
turbulence.  Since  optical  turbulence  is  usually  confined  to  thin,  horizontal  layers,  synoptic 
scale  variables  may  not  be  able  to  distinguish  the  fluctuations  in  C^.  Perhaps  a  study  to 
determine  the  statistical  relationship  between  Cn^  and  mesoscale  meteorological  variables 
m^  be  more  promising.  Such  a  task  will  prove  difficult,  however.  Since  data  from 
mesoscale  models  is  not  easily  accessible  and  may  not  be  available  in  areas  where  the 
Airborne  Laser  will  be  deployed.  Lastly,  if  the  data  is  available,  a  study  to  determine  the 
statistical  relationship  between  optical  turbulence  and  local  weather  phenomena  can  be 
pursued. 
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